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”By and large it is uniformly true that in mathematics that 
there is a time lapse between a mathematical discovery and the 
moment it becomes useful; and that this lapse can be anything 
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whole system seems to function without any direction, without 
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” There are some things which cannot 

be learned quickly, and time, which is all we have, 
must be paid heavily for their acquiring. 

They are the very simplest things, 

and because it takes a man’s life to know them 

the little new that each man gets from life 

is very costly and the only heritage he has to leave.” 


Ernest Hemingway 
(From A. E. Hotchner, PAPA HEMMINGWAY, Random House, 
NY, 1966) 
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Chapter 0 


Introduction 


”The advantage is that mathematics is a field in which one’s 
blunders tend to show very clearly and can be corrected or erased 
with a stroke of the pencil. It is a field which has often been 
compared with chess, but differs from the latter in that it is only 
one’s best moments that count and not one’s worst.” 

Norbert Wiener 

EX-PRODIGY: MY CHILDHOOD AND YOUTH 


Groups measure symmetry. No where is this more evident than in the 
study of symmetry in 2- and 3-dimensional geometric figures. Symmetry, 
and hence groups, play a key role in the study of crystallography, elemen- 
tary particle physics, coding theory, campanology (see §5.11 below), and the 
Rubik’ s cube, to name just a few. 

This is a book biased towards learning group theory not learning to solve 
*the cube”. To paraphrase the German mathematician David Hilbert, the 
art of doing group theory is to pick a good example to learn from. The 
Rubik’s cube will be our example. We motivate the study of groups by 
creating a group-theoretical model of Rubik’s cube-like puzzles. Although 
some solution strategies are discussed (for the 15 puzzle, the Rubik’s cube - 
the 3x3x3 and 4x4x4 versions, the ” Rubik tetrahedron” or pyraminx, the 
”Rubik dodecahedron” or megaminx, the skewb, square 1, the masterball, 
and the equator puzzle), these are viewed more abstractly than most other 
books on the subject. We regard a solution strategy merely as a reasonably 
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efficient algorithm for constructing any element in the associated group of 
moves. 

The approach here is different than some other group theory presentations 
(such as Rotman [R]) in that 


(a) we emphasize puzzle-related group theoretical examples over general 
theory, 


(b) we present some of the basic notions algorithmically (as in [Bu]), and 


(c) we tried to keep the level as low as possible for as long as possible 
(though only the reader can judge if we’ve keep it low enough long 
enough, or too low too long!). 


The hope is that by following along and doing as many of the exercises as 
possible the reader will have fun learning about how groups can be used to 
solve a real life” problem (assuming you consider solving the Rubik’s cube 
a problem from real life!). Along the way, we shall also explain the rules of a 
two person game (the Gordon game) which arises from group theory - a game 
which arises from a group not a group which arise from a game. See also the 
superflip game introduced in §4.5.2 (related to coin-turning games of H. W. 
Lenstra) and W. Hamilton’s Icosian game in §6.2.1. This is probably one of 
the few mathematics texts which contains an exercise of the form: “Play a 
game!”. 

Experience has shown that in order to keep up a pace which allows the 
class to finish the proof of the ”fundamental theorems of cube theory” (see 
§4.5.2 and §10.2) in one semester, the students should either (a) be candidates 
for the honors program or (b) have some group-theoretical background. More 
modest goals might be more appropriate for classes with less background. 

Acknowledgements: Many of the graphs given below were produced with 
the help of MAPLE. Some of the group-theoretical calculations (such as 
the order of a group element) were determined with the help of GAP. The 
contributions from students were a big help. I especially thank my former 
students Gen Gomes, Ann Luers, Jim McShea, Justin Montague, and Spencer 
Robinson for their help and collaboration on some of the topics. I’d also like 
to thank Andy Southern for collaborating on some of the masterball material 
[JS] which appears in chapter 14 and Dennis Spellman for the argument in 
89.7 where a presentation of wreath products is derived. 


Chapter 1 


Logic and sets 


"Tf logic is the hygiene of the mathematician, it is not his 
source of food; the great problems furnish the daily bread on 
which he thrives.” 

Andre Weil 

*The future of mathematics”, AMER MATH MONTHLY, 
May, 1950 


” The rules of logic are to mathematics what those of structure 
are to architecture.” 

Bertrand Russell 

MYSTICISM AND LOGIC AND OTHER ESSAYS, 1917, 
pol 


This chapter will present some background to make some of the termi- 
nology and notation introduced later a little clearer. It is not intended to be 
a rigorous introduction to mathematical logic. 


1.1 Logic 


A statement is a logical assertion which is either true or false. (Of course we 
assume that this admitedly circular definition’ is a statement.) Sometimes 
the truth or falsity of a statement is called its Boolean value. One can com- 
bine several statements into a single statement using the connectives ’and’, 
‘or’, and ‘implies’. The Boolean value of a statement is changed using the 
‘negation’. We shall also use ‘if and only if’? and “exclusive or’ but these can 
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be defined in terms of negation ~ and the other three connectives (V, A, and 
=). 


Exercise 1.1.1 Do this. In other words, express _V and <= > in terms of 
~,V,A, and=>. 


Notation: Let p and q be statements. 


statement notation terminology 
p and q pAq ” conjunction” 
porq pVvq ” disjunction” 
p implies q p>q ”conditional” 
negate p ~p *negation” 
p if and only if q p <= q| “if and only if” 
either p or q (not both) | p_Vq “exclusive or” 


Truth tables: Given the Boolean values of p,g, we can determine the 
values of pA gq, pVq,p>4,P = gq p_Aq using the truth tables: 


pP|}qd{|pAq P|qd/pVq 
T|/T| T T/T] T 
T|F) F TIF] T 
F/T] F F/T] T 
F/F | F F/F | F 
P|q/p>q pPlq|p—q 
T|T T T|T T 
T|F F T|F F 
FT T FT F 
F [EF T F [EF T 
P|}q|p-Vq 
p|~p 
T|T F 
T|F T T| F 
F/T T Fl] T 
F/ EF F 
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Exercise 1.1.2 (M. Gardner) Determine which of the following statements 
is true. 

e Exactly one of these statements is false. 

e Exactly two of these statements are false. 

e Exactly three of these statements are false. 

e Exactly four of these statements are false. 

e Exactly five of these statements are false. 

e Exactly six of these statements are false. 

e Exactly seven of these statements are false. 

e Exactly eight of these statements are false. 

e Exactly nine of these statements are false. 

e Exactly ten of these statements are false. 


Definition 1 Let B be the set {0,1} with the two operations addition (+) 
and multipication (*) defined by the following tables 


pla\|pta pla|p*q 
iii! 0 7/i| 1 
1/0) 1 110\ 0 
go; 1) 1 0\1\) 0 
010] 0 0'0\ 0 


(Note how these mimic the truth tables of ‘exclusive or’ (_V) and ‘and’ 
(A).) We call B the Boolean algebra. 


Exercise 1.1.3 Use truth tables to verify DeMorgan’s laws: 
(a) pN(qVr) => (pAQV(PAr), 
(b) pV (qAr) => (PV Q)A(pVr), 
and the laws of negation: 
(c)~ (pAqg) = (~P)V(~q) | 
(d) ~ (pV q) => (~p)A(~ Q). 
(You may want to do (a), (c), (d) first, then deduce (b) from these.) 
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Definition 2 A logical argument is a sequence of statements (called hypotheses) 


P15 P25++5Pn, which imply a statement q (called the conclusion). 
In other words, a logical argument is a true statement of the form 
(p1 Apo A... A Pn) > 4. 
Exercise 1.1.4 Use truth tables to verify the logical argument 
(p> aAq>r))>(p>r). 


Definition 3 ‘For all‘, written V, is the universal quantifier. ‘There exists‘, 


written J, is the existential quantifier. 
A variable is a letter denoting some (possibly unknown) object. A constant 
is a letter denoting some specific, well-defined object. A term is a variable or 


constant, 
A predicate is a function 


P: {terms} > {logical statements}. 


Example 4 ‘Daffy Duck‘ is a constant. ‘x and Daffy Duck are cartoon 
characters‘ is a predicate involving a variable and a constant. 


1.1.1 Expressing an everyday sentence symbolically 


When creating a model of the Rubik’s cube, we shall of course need to con- 
vert some ‘everyday statements’ into symbolical form in order to perform 
mathematical analysis. Let’s illustrate this with an example. 


Example 5 Consider the statement ”Each student in this class is a mathe- 
matics major”. Let 


M(a) = x is a mathematics major, S(x) =x is a student in this class. 


The symbolic form is 


Va, S(x) => M(x). 


Exercise 1.1.5 Convert “Someone in this class likes the Rubik’s cube” to 
symbolic form. 
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Exercise 1.1.6 (M. Gardner [Gar2]) Professor White, Professor Brown and 
Professor Black were lunching together. “Isn't it remarkable”, said the lady, 
“that our names are White, Black, and Brown and one of us has black hair, 
one has brown hair, and one has white hair.” 

*It is indeed”, answered the one with the black hair as Professor Black 
bit into his sandwich, “and have you noticed that not one has hair color to 
match our name?” 

The lady’s hair is not brown. What is the color of Professor Black’s hair? 


white | brown | black | lady 
White 
Brown 
Black 
lady x 


1.2 Sets 


A set is a ’well-defined’ collection of objects. The objects in a set are the 
elements of the set. 

There are two common ways to describe a set: 

(a) list all its elements (if the set is finite, e.g., {1,2,3}), 

(b) describe the set using properties of its elements (e.g., the set of all 
even integers can be described by {n | n an integer, 2 divides n}). 


Remark 1 We must be a little careful when describing sets using properties 
since some ‘self-referential’ properties lead to contraditions: let 


R={«a|«é«}. 


In other words, for allza, xe R = > x € x. In particular, if we takex = R 
then this becomes 


Rek = R€R, 


an obvious contradition. The problem is that R is not “well-defined” (in the 
sense that it does not satisfy the set-theoretic axioms which we will skip here). 


The empty set is the set containing no elements, denoted 9. 
Notation: Let S and T be sets. Assume that S CX. 
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statement notation terminology 
set of elements in S and T SnNT intersection 
set of elements in S or T’ SUT union 
set of elements in S or T (not both) SAT | symmetric difference 
set of elements not in S' Se complement 
Sis a subset of 7 SCT subset 


Exercise 1.2.1 Use Venn diagrams to verify the DeMorgan laws: 
(a) SA(PUU)=(SNT)U(SNYV) , 
(b) SU(TAU)=(SUTJN(SUTU) , 
and the laws of negation: 
(c) (SOT) =S°UT®, 
(d) (SUT) = S°n T°. 


Definition 6 We call two sets S,T disjoint if they have no elements in com- 
mon, i.e., if 


SAT=9. 
If 
S=ULSi, 
where the S,,5o,...,5, are pairwise disjoint sets then we call this union a 


partition of S. 
Example 7 If 
S=Z, S, = {...,-2,0,2,...} = even integers, 


Sy = {...,-3,-1,1,3,...} = odd integers, 


then S = S$, US» ts a partition of the integers into the set of even and odd 
ones. 


Logic/set theory analogs: Just as one can use connectives to form new 
statements from old statements, there are analogous ways to form new sets 
from old ones using ‘intersection’ (the analog of ’and’), ‘union’ (the analog 
of ’or’), and complement’ (the analog of ’negation’). The analog of implies’ 
is ‘subset’. 


1.2. SETS 


set theory logic 
sets statements 
union or 
intersection and 
subset implies 
symmetric difference | exclusive or 
equal if and only if 
Venn diagrams truth tables 


For more on logic or set theory, see for example [C] or [St]. 
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LOGIC AND SETS 


Chapter 2 


Functions, matrices, relations 
and counting 


”T think mathematics is a vast territory. The outskirts of 
mathematics are the outskirts of mathematical civilization. There 
are certain subjects that people learn about and gather together. 
Then there is a sort of inevitable development in those fields. You 
get to a point where a certain theorem is bound to be proved, 
independent of any particular individual, because it is just in the 
path of development.” 

Wiliam P. Thurston 

MORE MATHEMATICAL PEOPLE, NY, 1990, p332 


”Chance favors the prepared mind.” 


Louis Pasteur 


This chapter will introduce some frequently used notions. 


2.1 Functions 


Let S and T be finite sets. 


Definition 8 : A function f from S to T is a rule which associates to each 


element s © S exactly one elementt € T. We will use the following notations 


21 


22CHAPTER 2. FUNCTIONS, MATRICES, RELATIONS AND COUNTING 


for this: 
f:S—4T (fis a function from $ to T”), 
f:seoet (fsendssin S tot in T”), 
t= f(s) (tis the image of s under f”). 


A function is also called a map, mapping, or transformation. We call S 
the domain of f, 7’ the range of f, and the set 


f(S) ={f(s) eT | s€ S} 
the image of f. The Venn diagram depicting this setup is(see Fig 2.1): 


Figure 2.1: Venn diagram of f: S$ T 


The Cartesian product of two sets S, Tis the set of pairs of elements 
taken from these sets: 


Sx T= {(s,t)|seS,teT}. 


Example 9 /fR denotes the set of all real numbers then the Cartesian prod- 
uct R x R is simply the plane of pairs of real numbers we are all familiar 
with. 

More generally, we may iterate this process and take the Cartesian product 
the set of real numbers with itself n times (where n > 1 is any integer) to 
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get Rx... x R (n times). This n-fold Cartesian product is denoted more 
conveniently by IR". An element of the set R" will be called a vector or an 
n-vector to be specific. 


The graph of a function f : S — T is the subset 
{(s, f(s)|sEeS}CSxT. 


It is not possible for every subset of S x T the graph of some function 
from S to T’. The following fact classifies exactly which subsets of S x T can 
arise as the graph of a function from S to T. 


Lemma 10 : Let X CS xT. X is the graph of a function from S to T if 
and only if, for all (s,,t,) € X and (s9,t2) € X, whenever ty # ty we also 
have 81 # 89. 


Exercise 2.1.1 Verify this. (Hint: Let 


pry SX TOS 


(s,t) > s 
be projection onto the 1st component. Recall that the graph of a function has 
the property that pr{'(s) is always a singleton.) 


Definition 11 /f the image of the function f : S — T is all of T, ie., if 
f(S)=T, then we call f surjective (or “onto”, or “is a surjection”). 


Equivalently, a function f from $ to T is surjective if each t in T is the 
image of some s in § under f. Occasionally, you see the following special 
notation for surjective functions: 


f:S—>T  (f”maps S surjectively onto 7”). 


Exercise 2.1.2 State a general rule which determines those subsets X of 
Sx T which are the graph of some surjective function from S toT. (Use the 
projection onto the second component, 


pro: SX ToT 
(s,t) —t 


in your rule.) 
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Question: Suppose that |S] < |7'|. Is there a surjective function f : 
S — T? Explain. 


Definition 12 : A function f : S — T is called injective (or “one-to-one” 
or “an injection”) if each element t belonging to the image f(S) is the image 
of exactly one s in S. 


In other words, f is an injection if the condition f(s1) = f(s2) (for some 
81,82 € S) always forces s; = s2. Sometimes the “hook arrow” notation is 
used to denote an injective function 


f: ST. 


Question: Suppose that |.S| > |T|. Is there an injective function f : 5 > 
T? Explain. 


Exercise 2.1.3 Suppose that |S| = |T|. Show that a function f: S 4 T is 
surjective if and only if it is injective. 


Definition 13 A function f :.S — T is called a bijection if it is both injective 
and surjective. 


Equivalently, a bijection from $ to T is a function for which each t in T 
is the image of exactly one s in S. 


Exercise 2.1.4 Give an algorithm for determining if a given finite subset X 
of S x T is the graph of a bijection from S to T. 


Definition 14 A set S' is called countable if there exists a bijection f : S 
Z to the set of integers Z. 


Example 15 The set S of all rational numbers 0 <r <1 is countable since 
you can define f : S > Z@ as follows: f(1) = 1/2, f(2) = 1/3, f(3) = 2/3, 
/4) = 1/4, FG) = 3/4, (6) = 1/5, f(T) = 2/5, F(8) = 3/5, and so on. 
There are ¢(n) terms of the form m/n, where m is relatively prime ton and 
o(n) denotes the number of positive integers less than or equal to n which are 
relatively prime ton (i.€., have no common prime divisors). (b is sometimes 
called Euler’s phi function). 
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Exercise 2.1.5 Give an algorithm for determining if a given finite subset X 
of Sx T is the graph of a bijection from S to T. 


Example 16 Let C’ be the cube in 3-space having vertices at the points 
O= (000.0), (10, 0) a 0. OP (000.1 a = es 
(1,0,1), Ps = (0,1,1),P; = (1,1,1). We shall also (to use a notation which 
will be used more later) denote these by dbl, dfl, dbr, ubl, dfr, ufl, ubr, 
ufr, resp. Let Co = {O,Pi,..., Pz} be the set of the 8 vertices of C, let 
Cy = {uf,ur,ub, ul, fr, br, bl, fl, df,dr,db,dl} be the set of the 12 edges of 
C, and let Cp = {F,B,U,B,L,R} be the set of the 6 faces of C. Let r be 
the rotation of a point (x,y,z) by 180 degrees about the passing through the 
points (1/2,1/2,0), (1/2,1/2,1). Note that r: R°? > R® is a function which 
sends the cube C’ onto itself. 
The cube C' is pictured as: (see Fig 2.2): 


Figure 2.2: Cube in 3D space 


This function r induces three functions 
fo: Co > Co, feG, =, fo: Cy 7 Cp. 


where f; is the function which sends x € C; to its image r(x) under r (which 
is again in C;), fort =0,1,2. Each f; is a bijection. 
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Exercise 2.1.6 Finish the labeling of the vertices and the faces in the above 
picture and describe f, and fy explicitly by filling out the tables 


v | fo(e) fe} file) | fF | AW) 


Lemma 17 /f f : S — T is a bijection then there exists a function f~' : 
T > S defined by the following property: fors € S andt €T, we have 


f-'\()=s — if and only if f(s) =¢. 
Exercise 2.1.7 Prove this. 


Definition 18 The function f~' in the above lemma is called the inverse function 


of f. 


2.2 Functions on vectors 


This section presents a few basic facts about matrices, which we regard as 
a certain type of function which sends vectors to vectors. For further de- 
tails, see any text on linear algebra, for example [JN] where the historical 
introduction below is borrowed from. 


2.2.1 History 


The mathematician who first published a major work which seriously studied 
matrices and matrix algebra in the western world was Lord Arthur Cayley 
(1821-1895) of Cambridge, England. He wrote a memoir on the theory of 
linear transformations which was published in 1858 and is often thought of 
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as one of the “fathers” of matrix theory, though in fact it was his friend and 
colleague James Sylvester who first coined the term “matrix”. 

Here is one of the earliest examples which motivated Cayley: if we have 
three coordinate systems (2, y),(a’,y’), and (2”,y"”) , connected by 


es=auty 
ysun-y 
and 
gv! = —z' —y! 
y! = —a' +y' 
then the relationship between (x,y) and (#”,y”) is given 


oan ey =—(a+y)—(x—y) = —2a 
vty! =—(x@+y) +(e -y) = —2y 


If we do as Cayley did and abbreviate the three change of coordinates by the 
square array of the coefficients then we obtain the three arrays 


da type &]- 


Cayley noticed that there was an algebraic rule which allows us to determine 
the third array from the first two, without doing the substitution we did 
above to relate (x,y) to (a,y). This rule involves the rows of the first 
array and the columns of the second array. For example, to get the entry in 
the upper left-hand corner of the third array, Cayley explained we need to 
combine the first row of the first array with the first column of the second 
array as follows: —2 = 1-(—1)+1-(-—1). (The general formula will be given 
below.) To get the entry in the upper right-hand corner of the third array, 
we combine the first row of the first array with the second column of the 
second array: 0 = 1-(—1)+1-1. To get the entry in the lower left-hand 
corner of the third array, we combine the second row of the first array with 
the first column of the second array: 0 = 1 - (—1) + (—1)- (-1). Finally, to 
get the entry in the upper left-hand corner of the third array, we combine 
the first row of the first array with the first column of the second array: 
—2=1-(-1)+(-1)-1. The relationship which we just described betweeen 
the about three arrays he called ”matrix multiplication”. 
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2.2.2 3x3 matrices 
First, a 3 x 3 matrix is a 3 x 3 table of real numbers 


4141 412 413 
A= 421 422 423 


431 432 433 


which acts on R® by matrix multiplication: 


G41 412 413 x Gy1t + 12Y 1 4132 

4 
Av= G91 422 93 y = Gg1t + A224 1 932 
43, 432 433 z 4310 7 432Y + 433 


In general, any such 3 x 3 matrix gives rise to a function A: R® > R’. 


Example 19 /f A= J, the 3 x 3 identity matriz, 
1 0 0 
0 1 0 
00 1 

then 1,0 = 0 for all v EC R?. 


2.2.3. Matrix multiplication, inverses 


An m Xn matrix (of real numbers) is a rectangular array or table of numbers 
arranged with m rows and n columns. It is usually written: 


aii 412 Gin 

Gq, 422 G2n, 
A= 

Am Om2 oe Amn 


The (7,7) entry of A is a;;. The ith row of A is 
[ ait aig ct Gin | (1<i<m) 


The jth column of A is 
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A matrix having as many rows as it has columns (m = 7) is called a 
square matrix. The square n x n matrix 


i . 0 
0 

0 
0 0 1 


is called the n x n identity matrix and denoted / or J,. 


2.2.4 Muliplication and inverses 


You can multiply an m x n matrix A by an x p matrix B and get am x p 
matrix AB. The (7,7) entry of AB is computed as follows: 


1. Let k= 1 and c = 0. 


2. If k = m, you're done and a;; = ¢,. Otherwise proceed to the next 
step. 


3. Take the k entry of the 7 row of A and multiply it by the k entry 
of the j* row of B. Let cy = cg_1 + aiKDK;- 


4. Increment k by | and go to step 2. 


In other words, multiply each element of row 2 in A with the corresponding 
entry of column j in B, add them up, and put the result in the (2,7) position 
of the array for AB. 

If A is a square n Xn matrix and if there is a matrix B such that AB = [, 
then we call B the inverse matrix of A, denoted A7!. 


2.3. Relations 


A relation on a set is a generalization of the concept of a function from 5 to 
itself. 


Definition 20 ; Let S be a set. If R is a subset of S x S then we call R a 
relation on S. If (x,y) € R then we say that x is related to y. 
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We may also regard a relation R on S as a function 
R:Sx S— {0,1}. 


In this form, we say x is related to y (for x,y € S) if f(a,y) = 1. 

This is a very general notion. There are lots and lots of relations in 
mathematics - inequality symbols, functions, subset symbols are all common 
examples of relations. 


Example 21 Let S be any set and let f be a function from S to itself. This 
function gives rise to the relation R on S defined by the graph of f: 


R=({(x,yEeSxS|y=f(x), for re Sh}. 


(It is through this correspondence that we may regard a function as a rela- 
tion.) 


Example 22 Let S be the set of all subsets of {1,2,...,n}. Let R be defined 


by 
R = {(S1, So) | Sy Cc So, Sy € S, So € St. 


Note that R is a relation. 


Exercise 2.3.1 Find a relation corresponding to the symbol < on the real 
line. 


Definition 23 Let R be a relation ona set S. We call R an equivalence relation 


if 


a) any element s € S is related to itse reflexive 
(a) any el t S is related to itself (“reflexive”), 

(b) if s is related to t (i.e., (s,t) belongs to S) then t is related to s 
(“symmetry”), 

(c) if 8; is related to 582 and 82 is related to s3 then s, is related to 83 
(“transitivity”). 


Example 24 The equality symbol = provides an equivalence relation on the 
real line: let D = {(a,x) | x real}. This is an equivalence relation on the 
real line: note x =y if and only if (x,y) belongs to D. 


Notation: If R is an equivalence relation on S then we often write x ~ y 
or « = y in place of (x,y) € R, for simplicity. 
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Example 25 Fiz an integer n > 1. For integers x,y, define x = y if and 
only if n divides x —y. In this case, we say that x is congruent to y mod 
n. The equivalence class of x is sometimes (for historical reasons) called 
the residue class (or congruence class) of x mod n. This notation was first 
introduced by Gauss | 


Exercise 2.3.2 (a) Let f(x) = x”, let S be the real line, and let R be the 
corresponding relation as in the first example. Is R an equivalence relation? 

(b) Let f(x) = 2a, let S' be the real line, and let R be the corresponding 
relation as in the first example. Is R an equivalence relation? 


Exercise 2.3.3 Let R be the corresponding relation as in the second exam- 
ple. Is R an equivalence relation? 
Let R be an equivalence relation on a set S. Fors € S, we call the subset 


[s]={teS| sno 


the equivalence class of s in S. 


Example 26 For integers x,y, define x = y if and only if 3 divides x — y. 
the equivalence classes are 


Be 


Exercise 2.3.4 Show that for any 8, and 52 in S, we have either 


(a) [si] = [s2], or 


(b) [s1] is disjoint from |s9]. 


As a consequence of this exercise, we see that if A is an equivalence 
relation on a set 5 then the equivalence classes of R partition S into disjoint 
subsets. We state this as a separate lemma for future reference (we also 
assume SS is finite for simplicity): 


1C. F. Gauss, 1777-1855, is regarded by many as one of the top mathematicians of all 
time. At the age of 21 he wrote ”Disquisitiones Arithmeticae”, which started a new era 
of number theory and introduced this notation. 
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Lemma 27 : /[f S is a finite set and R is an equivalence relation on S then 
there are subsets 
5, CS, S.C S,...,5,% CS, 
satisfying the following properties: 
(1) S is the union of all the S;’s: 


(2) the S;’s are disjoint: for 1 <i<k, 1 <j <k, oft 4 then $;05; =9. 
(These last two properties say that the S;’s partition S in the sense of the 
previous chapter.) 
(3) the S;’s exhaust the collection of equivalence classes of R: for each 
1<ic<k, there is ans; € S such that 


S; = [s;]. 


(This element s; is called a representative of the equivalence class S;.) 


Example 28 For real numbers x,y, define x = y if and only if x —y is an 
integer. The equivalence classes are of the form 

[tq] ={..,2@-—2,e¢-—la,rtl,et+2,...}, 
for x real. Each equivalence class has exactly one representative in the half 


open interval [0,1). 


Remark 2 Conversely, given a partition as in (1), there is an equivalence 
relation R such that S; = |s;], for some some 8; € S, where 


[s]={xeES|svx} 
is the equivalence class of s with respect to R. Indeed, we define 
R=UL,S; x Sj. 
This is an equivalence relation and s ~ t if and only if s,t € S;, for some 
@=1,2,...,k. 


Exercise 2.3.5 Let S be the set Z of all integers. Let R be the relation 
defined by (x,y) € R if and only if x — y is an even number (i.e., an integer 
multiple of 2). 

(1) Show that this is an equivalence relation, 

(2) Find the sets S; in the above lemma which partition S, 

(3) Find a representative of each equivalence class §;. 
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2.4 Counting 


This section quickly surveys the few basic counting principles we shall use 
later. 
Addition principle: Let 5;,...,.5,, denote disjoint finite sets. Then 


|S; U...US,; = [Si] +... + /S,]. 


Example 29 /[f there aren bowls, each containing some distinguishable mar- 
bles and if S; is the set of marbles in the i” bowl then the number of ways to 
pick a marble from exactly one of the bowls is |Si|+...+|S,|, by the addition 
principle, 


Corollary 30 (Pigeonhole principle) If there aren objects (pigeons) which 
must be placed in m (md <n) boxes (pigeonholes) then there is at least one 
box with at least d+ 1 objects. 


Example 31 /f you are in a room with 9 others then there must be either at 
least 5 people you know or 5 people you don’t know (not counting yourself). 
In this case, there are n = 9 objects and m = 2 boxes (the friend box and the 
stranger box) so we may take d = 4 in the pigeonhole principle. 


Multiplication principle: Let Sj, ...,.5,, denote finite sets. Then 
|Sy x... XS, | = [Sy] +... [Sh]. 


Example 32 [f there aren bowls, each containing some distinguishable mar- 
bles and if 5; is the set of marbles in the i*” bowl then the number of ways 
to pick exactly one marble from each of the bowls is |Sy|-...+|S,|, by the 
multiplication principle. 


Corollary 33 The number of ordered selections, taken without repetition, of 
m objects from a set of n objects (m <n) is 


n! 
Corollary 34 The number of ordered selections, taken with repetition al- 
lowed, of m objects from a set of n objects (m <n) isn™. 
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Example 35 The number of n-tuples, without repetition, of objects from the 
set {1,2,...,n} is 
nb=n-(n—1)-..-1. 


Exercise 2.4.1 Let C be a set of 6 distinct colors. Fix a cube in space 
(imagine it sitting in front of you on a table). We call a coloring of the cube 
a choice of exactly one color per side. Let S be the set of all colorings of the 
cube. We say x,y € S are equivalent if x andy agree after a suitable rotation 
of the cube. 

(a) Show that this is an equivalence relation. 

(b) Count the number of equivalence classes in S. 


Chapter 3 


Permutations 


”What we have to learn to do, we learn by doing” 
Aristotle 
ETHICS 


”Mathematics, springing up from the soil of basic human ex- 
perience with numbers and data and space and motion, builds up 
a far-flung architectural structure composed of theorems which re- 
veal insights into the reasons behind appearences and of concepts 
which relate totally disparate concrete ideas.” 

Sanders MacLane 

AMER. MATH. MONTHLY, 1954 


Let J, = {1,2,...,n} be the set of integers from 1 to a fixed positive 
integer n. When n is fixed and there is no ambiguity sometimes we will 
simply write T for 7,,. A permutation of T is a bijection from T to itself. (A 
bijection was defined in Definition 13.) For example, on the 3 x 3 Rubik’s 
cube there are 9-6 = 54 facets. If you label them 1,2,...,54 (in any way 
you like) then any move of the Rubik’s cube corresponds to a permutation 
of 754. In this chapter we present some basic notation and properties of 
permutations. 

Notation: We may denote a permutation f: T — T by a2 x n array: 


J “(yy fa) fin) ) 
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Example 36 The identity permutation, denoted by I, is the permutation 
which doesn’t do anything: 


Definition 37 Let 


eM =#>t|fM>fG)}, Lsisn-l 


Let 
swap( f) = e¢(1) +... + ex(n — 1). 


We call this the swapping number (or length of the permutation f since it 
counts the number of times f swaps the inequality ini <j to f(t) > fly). 
If we plot a bar-graph of the function f then swap(f) counts the number of 
times the bar at i is higher than the bar at 3. We call f even if swap(f) is 
even and we call f odd otherwise. 


The number 
sign(f) = (-1)nn 


is called the sign of the permutation f. 


Example 38 Let n = 3, so T = {1,2,3}. We may describe the permutation 
f:T 3T for which f(1) = 2, f(2) =1,f(3) by a2 x 3 array 


1 2 3 
2 1 3 


or by a “crossing diagram”: 


3 -- 3 


The number of crosses in this diagram is the swapping number of f, from 
which we can see that the permutation is odd. 
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Exercise 3.0.2 Express f : T — T given by f(1) = 3, f(2) = 1, f(3) = 2, 
as (a4) a2 x3 array, (b) a crossing diagram. Find its swapping number and 
Sign. 


Definition 39 Let f : T — T andg :T — T be two permutations. We 
can compose them to get another permutation, the composition, denoted fq : 


ToT: 
tors ft) —> g(f(t)) 
T = T = T 


Notation We shall follow standard convention and write our composi- 
tions of permutations left-to-right. (This is of course in contrast to the 
right-to-left composition of functions you may have seen in calculus.) When 
a possible ambiguity may arise, we call this type of composition ”composi- 
tion as permutations” and call ” right-to-left composition” the ” composition 
as functions”. 

When f = g then we write ff as f?. In general, we write the n-fold 
composition f...f (n times) as f”. Every permutation f has the property 
that there is some integer N > 0, which depends on f, such that f% = 1. (In 
other words, if you repeatedly apply a permutation enough times you will 
eventually obtain the identity permutation.) 


Definition 40 The smallest integer N > 0 such that f’ = 1 is called the 
order of f. 


Example 41 Let T = {1,2,3} and let 


1 23 1 23 
r=(3 3) i=(; 3) 


We have 


Exercise 3.0.3 Compute (a) fg and (b) the order of f and the order of g, 


where 
123 123 
(a) r= (43 :) s=(; 3) 


:) 


oo 
ox 
—* 
eae 
I| 
———N 
QW re 
re bo 
NO Ww 
Ne 
I| 
———N 
NO FR 
re bo 
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Exercise 3.0.4 /f f,g,h are permutations of T, is (fg)h = f(gh)? Explain 
why. 


3.1 Inverses 


We can look at the graph of a function f : 7 — T and determine 

(a) if it is injective, 

(b) if it is surjective, 

(c) the inverse f~', if it exists. 
Indeed, from the graph of f we can determine the image f(7’) and this deter- 
mines if f is surjective or not. The inverse exists only if f is injective (hence, in 
our case, surjective by exercise 2.1.3). It’s graph is determined by reflecting 
the graph of f about the diagonal, x=y. 


Lemma 42 The following statements are equivalent: 
(1) f:T 4 T is injective, 
(2) f:T 4 T is surjective, 
(3) |F(L)| = [PI 


proof: The equivalence of the first two statements is by the exercise at 
the end of chapter 1. (2) is equivalent to (3), by definition of surjectivity. 


1 2 3 
3 1 2 
is obtained by reflecting the graph depicted in Fig 3.1 
about the x = y line (see Fig 3.2): 


Example 43 The inverse of 


Exercise 3.1.1 Graph and determine the inverses of the following permuta- 
tions: 
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qraph of permutation f:T > T 


Figure 3.1: 


Figure 3.2: 
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There are two more commonly used ways of expressing a permutation. 
The first is the ”matrix notation”: 


Definition 44 To a permutation f :T > T, given by 
fe ( 1 20. on ) 
FU) FQ) Flr) 
we associate to it the matrix P(f) of 0's and 1's defined as follows: the 1j-th 
entry of P(f) is 1 if7 = f(2) and is 0 otherwise. 
(A brief introduction to matrices is given in the appendix.) 


Definition 45 A square matrix which has exactly one 1 per row and per 
column (as P(f) does) is called a permutation matrix . 


Lemma 46 There are n! distinct n x n permutation matrices and there are 
n! distinct permutations of the set {1,2,...,n}. 


Exercise 3.1.2 Prove this using the multiplication counting principle from 
the section on counting in the previous chapter. 


Example 47 The matrix of the permutation f given by 


1 23 
r=(375) 


1s 


oOo cor 
re © 


Note that matrix multiplication gives 


0 1 0 1 2 
1 0 0 2;,=] 1 
0 0 1 3 3 


from which we can recover the 2 x 3 array. 
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Theorem 48 /f f: 7 — T is a permutation then 


Furthermore, the inverse of the matrix of the permutation is the matrix of 
the inverse of the permutation: 

(b) Pf)" = PUP), 

and the matrix of the product is the product of the matrices: 


(c) P(fg) = P(f)P(q). 


proof: If @ is the column vector with entries v,,v2,...,U, (the v; are 
arbitrary real numbers) then P(f)@ is the column vector whose 7” 
is equal to v; if f sends 2 to 7, by definition of P(f). Since, in this case, 
j = f(t) (here we write f(2) to denote the image of 7 under the permutation 
f,’ even though 7 really gets plugged into f on the left), this implies that 
P(f)v is the column vector with entries vs(1), Vs(2), «5 Vf(n). This proves (a). 

Note (b) is a consequence of (c) so we need only prove (c). We compute 
P(fg)v and P(f)P(g)%. By the same reasoning as in (a), we find that the 
i” coordinate of P( fg)? is vggy). Similarly, the 7” coordinate of P(g)@ is 
vi = vg). Therefore, the 7" coordinate of P(f)(P(g)®) is v4) = vg(f(0) = 
vg). This implies P( fg) = P(f)P(g)¥. Since the v; were arbitrary real 
numbers, this implies the theorem. 


coordinate 


Example 49 Let 


1 23 1 23 1 23 
r=(, I ) i=(; 2 ): b=(, 3 ): 


sof=f',g=g',h=fg. Moreover, 


001 010 
Pgj={010), Pih)={ 001 
100 100 


A direct matrix calculation verifies that P(f)P(g) = P(fg) = P(h) and 
Ph!) = P(g!f-!) = P(g!\P(F!) = Plg)P(f), as predicted by the 


above theorem. 
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The matrix can be determined from the graph of the function f : 7’ — T 
as follows: in the n x n grid of integral points (x,y), with x and y integers 
between 1 and n inclusive, fill in all the plotted points with 1’s and all the 
unplotted points with 0's The resulting n x n array is the matrix P(f). 

Rubik’s cubers will often, without knowing it perhaps, use the following 
lemma to solve their cube: 


Lemma 50 Let r € S, denote any permutation and let 1,3 denote distinct 
integers belonging to {1,2,...,n}. Let s denote the permutation sending i to 


j: 
(i)s = 7. 


Then s’ =r~'sr is the permutation sending (i)r to (j)r: 


More specifically (and this is the specific case which this lemma is most 
often applied): let t1,t2,...,%, denote distinct integers belonging to {1,2,...,n}. 
Let s denote the permutation sending t; to tj44: 


8(tj) = tj41, l<j<k, s(t) = %4, s(m) =m, Vm € {t1,.-,%e $- 
Then s" =r~'sr is the permutation sending (i;)r to (t;41)r: 


s'((a)r) = yaar, l<j<k, 8"((tz)r) = (a1 )r, s'(m) =m, Vm € {(ta)r,.., (te) 


In other words, if you have a move s which is a 3-cycle on 3 particular 
edges, say 


uf -> ult urt— uf, 


and another move r which sends these edges somewhere else, say r = F? 
so that r: uf -> df but leaves the other edges alone, then r~'sr is the 
permutation 


df -— ul +> ur += df. 


Try it! 
A proof of this lemma will be given in chapter 8. 
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3.2. Cycle notation 


The most common notation for a permutation is probably the ”cycle nota- 
tion”. The symbol 


(a, dg... Gp) (some r less than or equal to n) 
denotes the permutation f of 7’ which is defined by 
F(a) = 42, f(a) = 43, wy f(a) = a1, 


and f(7) = 2, if 7 is not equal to one of the ay,...,a,. Such a permutation is 
called cyclic. The number r is called the length of the cycle. 

We call two such cycles (a, a2... a,) and (b; 2 ... b;) disjoint if the sets 
{a1, d2,...,a,} and {by, by, ...,6:} are disjoint. 


Lemma 51 /f f and g are disjoint cyclic permutations of T then fg = gf. 


proof: This is clear since the permutations f and g of T affect disjoint 
collections of integers, so the permutations may be performed in either order. 


Lemma 52 The cyclic permutation (a, dz ... d-) has order r. 


proof: Note f(a) = a2, f?(a1) = a3, .., f(a) = @, f"(a1) = a1, by 
definition of f. Likewise, for any i = 1,...,r, we have f"(a;) = aj. 


Theorem 53 Every permutation f : {1,2,...,n} — {1,2,...,n} is the prod- 
uct of disjoint cyclic permutations. More precisely, if f is a permutation of 
{1,2,...,n} (with n > 1) then there are non-empty disjoint subsets of distinct 
integers 

Sy = {a11,---,d17,} C {1,2,...,n}, 

So = {aa1,-..,dar.} C {1,2,...,n}, 

_ 
Sh = {ag1, seey ak.r, } 

such that 


{1,2,..,.n}=S,U..US,, n=rtrot..+re, 


and 


f = (11, vey U1 py) (Agr, vey Uhr) 
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This product is called a cycle decomposition of f . If we rearrange the 
cardinalities r; of these sets $; in decreasing order, say we write this as 


ror See Sh, 
then the k-tuple (r},...,77,) is called the cycle structure of f and f is called 
a (rj,--,1,)-eycele. For example, (1,2)(3, 4,5) is a (3, 2)-cycle. 

proof: The proof is constructive. 

Let f: 7 — T bea permutation. List all the elements 


{C10 =leu= F(A), e12 = f?(1), e13 = PP), } 


(which must, of course, be finite in number but might also only contain the 
single element cig = 1). This is called the ”orbit of 1 under f”. Now list the 
elements in the ’orbit of 2”: 


{€20 = 2,¢1 = F(2), €22 = f?(2), c23 = f?(2), vfs 


and so on until we get to the “orbit of n”: 


{€no = 2,Cn1 = f(r), €n2 = f?(n), ns = f*(n), a}. 


If you pick any two of these n sets, they will either be the same (up to 
ordering) or disjoint. Denote all the distinct orbits which contain at least 
two elements by O4,...,O,. (It doesn’t matter what order you write them in 
or in what order you write the elements in each individual orbit.) Suppose 
that 

O71 = A115 ey A1yry sO |O;| =P, 

Og = d21,-4-,42%, 80 |O2| = 12, 


O; = Gk1, 5 Ok ry, so |O;| =P. 


In this case, ry + rg +... trp <n, with equality if and only if none of the 
orbits is a singleton. The cycle notation of f is the expression 


(411412..-01,71)---(@k1 Ak? kr, )+ 
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Example 54 e The cycle notation for 
(: 2 ;) 
2 1 3 
is (1 2). 


e The cycle notation for 


is (1 23). 


e The cycle notation for 


is (1 3)(2 4) =(2 4)(1 3). 


e The cycle notation for 


is (1 3)(2 45) = (45 2)(1 3). 
e The disjoint cycle decomposition of (2,3,7)(3, 7,10) ts (2,3)(7, 10). 


Exercise 3.2.1 Divide a square into 4 subsquares (”facets”) and label them 
1,2,3,4. For example, 
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Let r denote counterclockwise rotation by 90 degrees. Then, as a per- 
mutation on the facets, r = (1 3 4 2). Let f, denote reflection about the 
horizonal line dividing the square in two, let f, denote reflection about the 
vertical line dividing the square in two. Use the cycle notation to determine 
the permutations of the facets 

(a) r? 

(c) te, 

(d) fy 

(e) te * Tk fe, 

(1) fe fy. 


Exercise 3.2.2 Label the 24 facets of the 2 x 2 Rubik’s cube as follows: 


3.2. CYCLE NOTATION AT 


tooo o-oo + 

| 1 2 | 

| u | 

| 3 4 | 
tooo cocoon to------- oo tooo cocoon t---------- + 
| 5 6 | 9 10 | 13 14 | 17 18 | 
| 1 | f | r | b | 
| 7 8 | 11 12 | 15 16 | 19 20 | 
tooo o-oo to------- oo tooo cocoon t---------- + 

| 21 22 | 

| d | 

| 23 24 | 

tooo o-oo + 


(You may want to xerox this page then cut this cube out and tape it to- 
gether for this exercise.) Let X denote rotation clockwise by 90 degrees of 
the face labeled x, where x € {r,l, f,b,u,d} (so, for example, if x = f then 
X =F), Use the cycle notation to determine the permutations of the facets 
given by 

(a) R, 

(b) L, 

(c) FP, 

(d) B, 

(e) U, 

(f) D. 


Lemma 55 A cyclic permutation is even if and only if the length of its cycle 
is odd. A general permutation f :T — T is odd if and only if the number of 
cycles of even length in its cycle decomposition is odd. 
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We shall not prove this here. (For a proof, see Theorem 3.3 in Gaglione 
[G], §3.2.) 


Lemma 56 The order of a permutation is the least common multiple (lem) 
of the lengths r1,1r2,...,7, of the disjoint cycles in its cycle decomposition. 


Example 57 The order of (1 3)(2 4) is 2. Itis even. The order of (1 3)(24 5) 
is 6. It is odd. 


3.3. An algorithm to list all the permutations 


In Martin Gardner’s Scientific American article [Garl] an algorithm is men- 
tioned which lists all the permutations of {1,2,...,n}. This algorithm, due 
originally to the mathematician Hugo Steinhaus, gives the fastest known 
method of listing all permutations of {1, 2,...,n}. 

We shall denote each permutation by the second row in its 2 x n array 
notation. 

For example, in the case n = 2: 


NO oF 
r bo 


are the permutations. 
To see the case n = 3, the idea is to 
(a) write down each row n = 3 times each as follows: 


2 


Now NM Re RR 
me rR Fr PS Wb 


(b) *weave” in a 3 as follows 


ewWMrF Ww bd 
Wr rR bh bh ww 


NOM wwr, rR 
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In case n = 4, the idea is to 


(a) write down each row n = 4 times each as follows: 


ror er SE kr EWR ww NON ON NMR RP rR Fr we wwe wnMmnh dS we 
WWW Wr Fr eR rE rE ENO NNMNNNMNMNWN WWW Ww 


NPNMwmONMwMWNwWMNMWMNW Ww wWwWwwowiw wr FF Fr FR rR FR - 
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(b) now ” weave” a 4 in: 


WNNFENMONNWWWK BR WWW Pe BR BRE EEE 
PrPeNWNWRWWNHNHNDRWWERP REP WW RR BD DD 
WORE rPwWwWRE PBR DNYDHEPEP KR DYNDAWwWWwWNHNABAW 
BRwWWwWWe PPAR RE FP ENWNMNNBKNYNYNWWWE 


In general, we have the following 


Theorem 58 (Steinhaus) There is a sequence of (not necessarily distinct) 
2-cycles, (a1, 61),...,(an,6n), where N = n!—1, such that each non-trivial 
permutation f of {1,2,...,n} may be expressed in the form 


k 


f = [[ (a. 60). 


i=l 


for some k, 1<k< N. Furthermore, these products (for k = 1,2,...,N) are 
all distinct. 
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In other words, each permutation may be written as a product of (not 
necessarily disjoint) 2-cycles. This will be proven in section 5.11 on Cam- 
panology below. 

There is an analogous result valid only for even permutations: each even 
permutation may be written as a product of (not necessarily disjoint) 3- 
cycles. This will be stated more precisely (and proved) later - see Proposition 


159. 


52 


CHAPTER 3. PERMUTATIONS 


Chapter 4 


Permutation Puzzles 


”How can it be that mathematics, being after all a product 
of human thought independent of experience, is so admirably 
adapted to the objects of reality?” 

Albert Einstein 


”'Though this be madness, yet there is method in’t.” 
Shakespeare 


We shall describe several permutation puzzles in this chapter. 
A one person game is a sequence of moves following certain rules satisfy- 
ing 


e there are finitely many moves at each stage, 

e there is a finite sequence of moves which yields a solution, 
e there are no chance or random moves, 

e there is complete information about each move, 


e each move depends only on the present position, not on the existence or 
non-existence of a certain previous move (such as chess, where castling 
is made illegal if the king has been moved previously). 


A permutation puzzle is a one person game (solitaire) with the following 
five properties listed below. Before listing the properties, we define the ” puz- 
zle position” to be the set of all possible legal moves. The five properties of 
a permutation puzzle are: 
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1. for some n > 1 depending only on the puzzle’s construction, each move 
of the puzzle corresponds to a unique permutation of the numbers in 


T = {1,2,...,n}, 


2. if the permutation of T in (1) corresponds to more than one puzzle 
move then the the two positions reached by those two respective moves 
must be indistinguishable, 


3. each move, say M, must be “invertible” in the sense that there must 
exist another move, say M~!, which restores the puzzle to the position 
it was at before M was performed, 


4. if M, is a move corresponding to a permutation f;, of T and if My isa 
move corresponding to a permutation fy of T then M,* M, (the move 
M, followed by the move M2) is either 


e not a legal move, or 


e corresponds to the permutation f, * fo. 


Notation: As in step 4 above, we shall always write successive puzzle 
moves left-to-right. 


4.1 15 puzzle 


One of the earliest and most popular permutation puzzles is the ”15 puzzle”. 
The ”solved position” looks like (see Fig 4.1) 

These numbered squared represent sliding blocks which can only move 
into the blank square. We shall sometimes label the blank square as ”16” for 
convenience. The moves of the puzzle consist of sliding numbered squares 
(such as 12, for example) into the blank square (e.g., swapping 12 with 16). 
In this way, each move of this puzzle may be regarded as a permutation of 
the integers in {1,2,..., 16}. 


Exercise 4.1.1 Check that the five conditions of a permutation puzzle are 
satisfied by the 15 puzzle. 


Not every permutation of the {1,2,...,16} corresponds to a possible po- 
sition of the puzzle. For example, the position (see Fig 4.2) 
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The 15 puzzle 


Figure 4.1: The 15 Puzzle 


The 14-15 puzzle 


Figure 4.2: The 14-15 Puzzle 
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cannot be attained from the previous position. (The mathematical reason for 
this is explained in 6.3 below, for example.) Apparently, the puzzle inventor 
Sam Loyd applied for a U.S. patent for the above puzzle (the one with the 14, 
15 swapped) but since it could not be ”solved” - i.e., put in the correct order 
1,2,...,15 - no working model could be supplied, so his patent was denied. 
(This is in spite of the fact that there were apparently thousands of them on 
the market already.) 

The moves of the 15 puzzle may be denoted as follows: suppose we are 
in a position such as (see Fig. 4.3) 


U,D,L,R denote any of the 
R numbers 1,2,...,15 


The 15 puzzle 


Figure 4.3: The 15 Puzzle 


The possible moves are 


R= Rurai = (r 16) = swap r and 16, 
DL = Lua. = (1 16) = swap | and 16, 
U = Uura = (u 16) = swap u and 16, 
D =D.» a = (d 16) = swap d and 16. 


Exercise 4.1.2 Verify that the five defining properties of a permutation puz- 
zle are satisfied by this example. 


We shall call the 15 puzzle a planar puzzle since all its pieces lie on a flat 
board. 
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4.2 Devil’s circles (or Hungarian rings) 


This is a planar puzzle consisting of two or more interwoven ovals, each of 
which has several labeled (by colors or numbers) pieces, some of which may 
belong to more than one oval. A puzzle move consists of shifting an oval by 
one or more “increments”, and hence all the pieces on it, along the oval’s 
grooved track. The pieces are equally spaced apart (in spite of the typed 
depiction below) and those pieces which lie on more than one oval can be 
moved along either oval. 

For simplicity, consider the puzzle consisting of only two ovals, each hav- 
ing 6 pieces: (see Fig 4.4) 


Hungarian rings 


Figure 4.4: Hungarian rings 


The pieces 1 and 3 can be moved along either oval. Note that each move 
corresponds to a unique permutation of the numbers in {1,2,...,10}. For 
example, rotating the right-hand oval clockwise one increment corresponds 
to the permutation 

123 45 678 9 10 
612 3 45 7 8 9 10 


which we may write in cycle notation as (6 5 4 3 2 1). 
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Exercise 4.2.1 Verify that the five defining properties of a permutation puz- 
zle are satisfied by this example. 


4.3 Equator puzzle 


This puzzle is in the shape of a sphere but has 3 circular bands encircling 
a sphere, each having 12 square-shapped pieces and each band intersecting 
each other at a 90 degree angle. Each pair of circles intersects at two points, 
or “nodes”, and at each such node there is a puzzle piece shared by the two 
circular bands. There are 6 nodes total. The total number of movable pieces 
is therefore 3-12 —6 = 30. 

On some puzzles the sphere is painted a map of the earth, others have 
colored puzzle pieces. The rough idea, minus any colors, is depicted in the 
following picture: (see Fig 4.5) 


Equator puzzle 


Figure 4.5: Solved masterball (front) 


For concreteness, suppose we are looking at a globe equator puzzle. The 
longitudinal band circles the equator, one latitudinal band passes through 
North America and the other latitudinal band passes through Europe and 
Africa. A move of the puzzle consists of rotating one of the bands (along 
with all the pieces it contains) in either direction. Successive puzzle moves 
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may change the “orientation” of a piece, as we will see later. When the 30 
pieces are such that the puzzle is a correct map of the Earth then we call the 
position ”solved”. 

For ease of drawing, let us redraw the globe of the Earth using the ” Mer- 
catur projection”, i.e., as a wall map: 


1 1 1 1 
2 13 12 22 
3 14 11 21 
4 23 24 15 25 26 10 27 28 20 29 30 
5 16 9 19 
6 17 8 18 
7 7 7 7 


The "solved" position 


Sometimes we shall also denote 1 by NP ("north pole”) and 7 by SP 
("south pole” ). 

This description is a little misleading due to the fact that it tells us where 
a piece is but not, for example, whether it is upside down or not. We shall 
ignore this problem for now and simply describe how a move affects the 
position of a piece. We see from the above labeling that any move of the 
Equator puzzle corresponds to a unique permutation of the integers in 1, 2, 
..., 30. For example, the move which rotates the equator east-to-west by 30 
degrees corresponds to the permutation 


(4 30 29 20 28 27 10 26 25 15 24 23). 


Now we shall show to assign an orientation to a piece. We shall regard an 
orientation (which is, after all, simply an indication of what angle the piece 
is twisted”) as an integer 0, 1 ,2, or 3. First, if a piece is not in its correct 
position, it gets an orientation of 0. If a piece is in its correct position then 
it gets a 0, 1, 2, or 3, depending on its angle from the correct angle (i.e., the 
angle the piece has in the ”solved” position): 
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Example 59 A piece which has been rotated by 90 degrees counterclockwise 
from its correct orientation gets an orientation of 8. 


In general, the labels for the pieces of the Equator puzzle should be 
choosen from the set given by the Cartesian product of the set of integers 
used to label the positions, {1,2,...,30}, by the set of integers used to label 
the orientations: 


S = {1,2,...,30} x {0,1,2,3} = {(m,n) | 1 <m <30,0 <n <3}. 


Each move of the Equator corresponds to a unique permutation of the 
set S. There are 120 elements of $,c all them 


S = {$1, $2, v3 $120 }- 


If we identify the set S with the set T = {1,...,120} then we move of the 
Equator corresponds to a unique permutation of the set 7’. 


Exercise 4.3.1 Verify that the Equator puzzle satisfies the five defining prop- 
erties of a permutation puzzle. 


Question: Can you show the following: If a piece is correctly oriented 
then its antipodal piece is also correctly oriented? 

Notation: We introduce notation for 3 basic moves of the Equator puzzle 
which generate all possible puxzzle moves. Let us label the three circular 
bands on the globe as Cl, C2, and C3. Let Cl be the band which, in the 
solved position, contains the pieces labeled 1, 2, ..., 12; let C2 be the band 
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which, in the solved position, contains the pieces labeled 7, 13, ..., 22; and 
let C3 be the band which, in the solved position, contains the pieces on the 
equator. 

Let r; be the puzzle move associated to the rotation of Cl given by 


9 10 11 12 


12 3 4 7 8 
23 4 5 8 9 10 11 12 1 


Let rz be the puzzle move associated to the rotation of C2 given by 


1 13 14 15 16 17 17 18 19 20 21 22 
13 14 15 16 17 17 18 19 20 21 22 1 


Let r3 be the puzzle move associated to the rotation of C3 given by 


4 23 24 15 25 26 10 27 28 20 29 30 
30 4 23 24 15 25 26 10 27 28 20 29 


It is clear after a little thought that each of these moves corresponds to a 
permutation of the 120 position/orientation labels of the pieces of the equator 
puzzle. Furthermore, such a permutation determines the puzzle position 
uniquely since it specifies the facet’s position and orientation. 


Exercise 4.3.2 Verify that the remaining properties of a permutation puzzle 
are satisfied. 


4.4 Rainbow Masterball 


Some rules for the rainbow masterball (referred to simply as ”masterball” 
in the following): A masterball sphere has 32 tiles of 8 distinct colors. We 
shall assume that the masterball is in a fixed position in space, centered 
at the origin. A geodesic path from the north pole to the south pole is 
called a longitudinal line and a closed geodesic path parallel to the equator 
is called a latitudinal line. There are 8 longitudinal lines and 3 latitudinal 


lines. In spherical coordinates, the longitudinal lines are at the angles which 
are multiples of 7/4 (Le., at 6 = nz/4, n =1,..,8) and the latitudinal lines 
are at @ = 7/4,7/2,30/4. (Here m = 3.141592... as usual.) (see Fig 4.6) 
The sphere shall be oriented by the right-hand rule - the thumb of the 
right hand wrapping along the polar axis points towards the north pole. We 
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Figure 4.6: 


assume that one of the longitudinal lines has been fixed once and for all. This 
fixed line shall be labeled ”1”, the next line (with respect to the orientation 
above) as ”2”, and so on. 

Allowed moves: One may rotate the masterball east-to-west by multiples 
of x/4 along each of the 4 latitudinal bands or by multiples of x along each 
of the 8 longitudinal lines. 

A facet. will be one of the 32 subdivisions of the masterball created by 
these geodesics. A facet shall be regarded as immobile positions on the sphere 
and labeled either by an integer? € {1,...,32} or by a pair (2,7) € [1,4] x[1, 8], 
whichever is more convenient at the time. If a facet has either the north 
pole or the south pole as a vertex then we call it a small (or polar) facet. 
Otherwise, we call a facet large (or middle or equatorial). A coloring of the 
masterball will be a labeling of each facet by one of the 8 colors in such a 


way that 

(a) each of the 8 colors occurs exactly twice in the set of the 16 small 
facets, 

(b) each of the 8 colors occurs exactly twice in the set of the 16 large 
facets. 

A move of the masterball will be a change in the coloring of the masterball 
associated to a sequence of manuevers as described above. 
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If we now identify each of the 8 colors with an integer in {1,...,8} and 
identify the collection of facets of the masterball with a 4x8 array of integers 
in this range. To solve an array one must, by an appropriate sequence of 
moves corresponding to the above described rotations of the masterball, put 
this array into a ”rainbow” position so that the matrix entries of each column 
has the same number. Thus the array 


123 45678 
123 45678 
123 45678 
123 45678 
is ’solved”. The array 
6 78 123 4 5 
123 45678 
123 45678 
123 45678 


corresponds to a rotation of the north pole facets by 37/4. 

Notation We use matrix notation to denote the 32 facets of the mas- 
terball. The generators for the latitudinal rotations are denoted rj, 12,173, 14. 
For example, r, sends 


ll 12 13 14 15 16 17 18 
21 22 23 24 25 26 27 28 
31 32 33 34 35 36 37 38 
41 42 43 44 45 46 47 48 


to 
12 13 14 15 16 17 #18 11 
21 22 23 24 25 26 27 28 
31 32 33 34 35 36 37 38 
41 42 43 44 45 46 47 48 


which is pictured as: (see Fig 4.7) 

As you look down at the ball from the north pole, this move rotates the 
ball clockwise. The other moves r2,73,7r4 rotate the associated band of the 
ball in the same direction - clockwise as viewed from the north pole. 
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rl (rotates top band 
clockwise) 


Figure 4.7: 


The generators for the longitudinal rotations are denoted /,, fo,..., fg. For 
example, f; sends 


11 12 13 14 15 16 17 18 
21 22 23 24 25 26 27 28 
31 32 33 34 35 36 37 38 
41 42 43 44 45 46 47 48 


to 
44 43 42 41 16 17 18 11 
34 33 32 31 25 26 27 28 
24 23 22 21 35 36 37 38 
15 14 13 12 45 46 47 48 


which is pictured as: (see Fig. 4.8) 
With these rules, one can check the relation 


fori erg trg erg * fart erg #15 * TK. 
Exercise 4.4.1 Find similar identities for fo, fr, fs. 


Also, one can check that 


ry = (fs * fr)" «7 & fg fr. 
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f1 (rotates by 180 
degrees along the first 


ae 


Figure 4.8: 


Exercise 4.4.2 There are similar identities for ro,1r3,r4. Find them. 


Identify the facets of the masterball with the entries of the array 


8 * 6 5 4 3 2 #1 
16 15 14 13 12 11 10 9 
24, 28) 22 20. 200 A 8. Le 
32 31 30 29 28 27 26 25 


(there is a reason for labeling the facets ” backwards” like this but it’s not im- 
portant). We may express the generators of the masterball group in disjoint 
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cycle notation as a subgroup of S32 (the symmetric group on 32 letters): 


7’ = (12345678), 


= (171819 2021 2223 24), 

= (25 26 27 28 29 3031 32), 
fi = (5 32)(6 sits 0)(8 29)(13 24)(1423)(15 22)(1621), 
= (431)(530)(6 29)(7 28)(12 23)(13 22)(14 21)(15 20), 
fs = (3 30)(4 29)(5 28) (6 27)(11 22)(12 21)(13 20)(14 19), 
4 = (2 29)(3 28)(4 27) (5 26)(10 21)(11 22)(12 23)(13 24), 
fs = (1 28)(2 27)(3 26) (4 25)(9 20)(10 19)(11 18)(12 17), 
fs = (8 27)(1 26)(2 25) (3 32)(16 19)(9 18)(1017)(11 24), 
= (7 26)(8 25)(1 32)(231)(15 18)(16 17)(9 24)(10 23), 
= (6 25)(7 32)(831)(130)(14 17)(15 24)(16 23)(9 22), 


Exercise 4.4.3 Verify that the properties of a permutation puzzle are satis- 


fied for this puzzle. 


More information on this puzzle will be given in a later chapter. 


4.5 Rubik’s cubes 


We shall introduce the 2 x 2, 3 x 3,4 x 4, and higher Rubik’s cubes. 


4.5.1 2x2 Rubik’s cube 


The ” pocket” Rubik’s cube has 6 sides, or ”faces”, each of which has 2-2 = 4 
"facets”, for a total of 24 facets: (see Fig 4.9) 

Fix an orientation of the Rubik’s cube in space. Therefore, we may label 
the 6 sides as f, b, l, r, u, d, as in the picture. It has 8 subcubes. Each 
face of the cube is associated to a ”slice” of 4 subcubes which share a facet 
with the face. The face, along with all of the 4 cubes in the ”slice”, can 
be rotated by 90 degrees clockwise. We denote this move by the upper case 
letter associated to the lower case letter denoting the face. For example, F 
denotes the move which rotates the front face by 90 degrees to clockwise. 
(see Fig 4.10) 

As in chapter 4, we label the 24 facets of the 2 x 2 Rubik’s cube as follows: 
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6 sides: (F) front, B (back), 
L (left), R (right), U (up), D (down) 


Figure 4.9: 


The move F 


Figure 4.10: The move F 
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toon nnn + 

| 1 2 | 

| u | 

| 3 4 | 
tonne tonne nen nnn--- toon nnn tonne one nn---- + 
| 5 6 | 9 10 | 13 14 | 17 138 | 
| 1 | f | r | | 
| 7 8 | 11 12 | 15 16 | 19 20 | 
tonne tonne nen nnn--- toon nnn tonne one nn---- + 

| 21 22 | 

| d | 

| 23 24 | 

toon nnn + 


The 24 facets will be denoted by xyz where x is the face on which the 


facet lives and y, z (or z, y - it doesn’t matter) indicate the 2 edges of the 


facet. Written in clockwise order: 


front face: 
back face: 
right face: 
left face: 
up face: 
down face: 


Exercise 4.5.1 Verify that the properties of a permutation puzzle are satis- 


fied for this puzzle. 


For future reference, we call this system of notation (which we will also 
use for the 3 x 3 and 4 x 4 Rubik’s cube) the Singmaster notation. 


fru, 
blu, 
rbu, 
lfu, 
urb, 
drf, 


frd, 
bld, 
rbd, 
lfd, 
urf, 
drb, 


fld, 
brd, 
rfd, 
lbd, 
ulf, 
dlb, 


flu 
bru 
rfu 
lbu 
ulb 
d1f 
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4.5.2 3x3 Rubik’s cube 


In this section we shall, for the most part, simply introduce enough notation 
(due to Singmaster [Si]) to allow us to check that the puzzle is in fact a 
permutation puzzle. We shall also introduce a two-person game which is 
easier to play and learn than solving the cube. 

The Rubik’s cube has 6 sides, or ”faces”, each of which has 3-3 = 9 
*facets”, for a total of 54 facets. We label these facets 1,2,...,54 as follows: 


tonne + 
| 1 2 3 | 
| 4 u 5 | 
| 6 7 8 | 
tonne tonne nen nnn--- toon nnn tonne one nn---- + 


tonne tonne nen nnn--- toon nnn tonne one nn---- + 
| 41 42 43 | 
| 44 d 45 | 
| 46 47 48 | 
tonne + 


then the generators, corresponding to the six faces of the cube, may be 
written in disjoint cycle notation as: 


F = (17 19 2422) (18 21 23 20)(6 25 43 16)(7 28 42 13)(8 304111), 
B = (333540 38)(34 37 39 36)(3 9 46 32)(2 1247 29)(1 14 4827), 
L = (91116 14)(10 13 15 12)(1 17 41 40)(4 20 44 37) (6 22 4635), 
R= (252732 30) (26 29 31 28)(3 38 43 19)(5 36 45 21)(8 33 48 24), 
U = (1386)(2574)(9 33 25 17)(10 34 26 18)(11 35 27 19), 
D = (41.43.48 46)(42 45 47 44)(14 22 30 38)(15 23 31 39)(16 24 32.40). 
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Exercise 4.5.2 Check this. (It is helpful to xerox the above diagram, cut it 
out and tape together a paper cube for this exercise.) 


The notation for the facets will be similar to the notation used for the 
2 x 2 Rubik’s cube. The corner factes will have the same notation and the 
edge facets will bve denoted by xy, where x is the face the facet lives on and 
y is the face the facet borders to. In clockwise order, starting with the upper 
right-hand corner of each face: 


front face: fru, fr, frd, fd, fld, fl, flu, fu 
back face: blu, bl, bld, bd, brd, br, bru, bu 
right face: rbu, rb, rbd, rd, rfd, rf, rfu, ru 
left face: lfu, 1f, lfd, ld, lbd, lb, lbu. lu 
up face: urb, ur, urf, uf, ulf, ul, ulb, ub 
down face: drf, dr, drb, db, dlb, dl, dlf, df 


Exercise 4.5.3 Verify that the properties of a permutation puzzle are satis- 
fied for this puzzle. 
First fundamental theorem of cube theory 


We shall frequently need the following fact: 


Theorem 60 Beginning with a solved cube, label the following facets with 
an invisible “+” (i.e., mark the spatial position of the facet on the cube with 
a 74"): 


e U facet of the uf edge subcube 
e U facet of the ur edge subcube 
e F facet of the fr edge subcube 
e all facets which can be obtained from these by a move of the slice group. 


Label the U and D facets of each corner subcube with an invisible ”+”. These 
”4” signs are called the standard reference markings. Each move g of the Ru- 


bik’s cube yields a new collection of ”+” labels, called the markings relative to g. 
A position of the Rubik’s cube is determined by the following decision process: 
(a) How are the edge subcubes permuted? 
(b) How are the center subcubes permuted? 
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(c) How are the corner subcubes permuted? 

(d) Which of the relative edge markings are flipped (relative to the stan- 
dard reference markings)? 

(e) Which of the relative corner markings are rotated from the standard 
reference markings and, if so, by how much (2x/3 or 4x/3 radians clockwise, 
relative to the standard reference markings)? 


This is labeled as a theorem because of its relative importance for us, not 
because of it’s difficulty! This is the ” First Fundamental Theorem of Rubik’s 
cube theory”. 

As an exercise, the reader should convince him or herself that this theorem 
is correct. 


The superflip game 


The position of the Rubik’s cube where every edge is flipped, but all the 
others subcubes are unaffected, is called the superflip position. 


To play the game, first choose two particular faces as your up (U) and 
front (F) face - say white is up and red is front (assuming you have a cube 
with adjacent white and red faces). Imagine the cube being placed in space 
with rectangular coordinate axes in such a way that the bdl corner is at the 
origin (0,0,0), the dl edge is along the x-axis and the 6/ edge is along the 
Z-axis. 

The rules ("slice-superflip game” ): 


1. Players alternate making moves starting with the cube in the solved 
position. The first player is determined by (say) a coin toss. 


2. A move consists of flipping over exactly two edges. Both edges must 
lie in a slice. The edge closest to the origin (or, if this is a tie, closest 
to the x-axis) must be flipped from solved” to wrong”. 


3. The first player to reach the superflip position wins. 


Of course, to play this game you must know several edge-flipping moves, 
such as those in §14.2.1 below. 

This game is related to a game which might be called a three dimensional 
acrostic twins game”. (See [BCG], vol II, page 441, for a two dimensional 
acrostic twins game.) 

Several alternate versions of this game may also be played. 
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”Nonslice-superflip game”: The rules are the same except the condition 
that the two edges belong to the same slice is either dropped altogether or 
replaced by the condition that the two edges do not belong to the same slice. 

”Mobius-superflip game”: The rules for this version are the same except 
the condition that two edges are flipped is to be replaced by any number 
of edges less than 6 (i.e., exactly 2, 4, or 6) is to be flipped. This game is 
related to a game which might be called a ”three dimensional Mobius”. (See 


[BCG], vol II, page 434, for a Mobius game.) 


Exercise 4.5.4 Play a game! 


4.5.3. 4x4 Rubik’s cube 
The 4 x 4 Rubik’s cube has 6 sides, or ”faces”, each of which has 4-4 = 16 


*facets”, for a total of 96 facets. As usual, we fix an orientation of the cube 
in space, so we may pick a front face, back face, ..... We label these facets 
1,2,...,96 as follows: 
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The reader may want to xerox the above diagram, cut it out and tape 
together a paper cube. 

Notation: We need notation for the facets and for the moves. 
Facets: To label the facets, we must pick an orientation of each face, say 
clockwise. For example, the the front face may be labeled as 
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The labeling of the other faces is similar. 


Exercise 4.5.5 Label the other 5 faces. 


Moves: Parallel to each face x are 4 slices of 16 subcubes each labeled 
X1,X2,X3, X4, in order of their distance from the face. For example, the 
front face f has 16 subcubes comprising the F, slice, the two inner slices are 
Fy, Fs, and the last slice Fy is actually the same as the first slice B, associated 
to the back face. 
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The 12 generators (written in disjoint cycle notation), corresponding 2 
each to the six faces of the cube are given by: 


U, = (49 52 88 85)(62 63 75 74)(50 64 87 73) x 
(51 76 86 61)(5 153 9)(6 254 10)(7 355 11)(8 456 12), 
Uy = (171365 21)(18 14 66 22)(19 15 67 23)(20 16 68 24), 
L, = (964849 1)(84 36 61 13)(72 24 73 25)(60 12 8537) x 
x (89 53 56 92)(90 65 55 80)(91 77 54 68)(66 67 79 78), 
Ly = (5911 86 38)(71 23 74 26)(83 35 62 14)(95 4750 2), 
F, = (89593 92)(77 17 81 80)(65 29 69 68)(53 41 57 86) x 
(1440 37)(2 16 39 25)(3 28 38 13)(14 15 27 26), 
Fy = (73681 91)(74 18 82 79)(75 30 83 67)(76 42 8455), 
R, = (4088 957)(28 76 21 69)(16 64 3381) (45249 93) x 
(41 58 44)(42 17 732) (43 29 6 20)(18 19 31 30), 
Ry = (3987 10 58)(27 75 22 70)(15 6334 82)(351 4694), 
By = (5253 44 60)(51 65 3259)(50 77 20 58)(49 89 857) x 
(9 1248 45)(10 24.47 33)(11 36 46 21)(22 23 35 34), 
By = (547243 64)(66 71 31 63)(78 70 19 62)(90 69 761), 
Dy, = (57 60 96 93)(58 72 95 81)(59 84 94 69) x 
(70 71 83 82)(45 89 37 41)(46 90 38 42)(47 91 39 43)(48 92 40 44), 
Dy = (33.77 25 29)(34 78 26 30)(35 79 27 31)(36 80 28 32). 


Exercise 4.5.6 Verify that the properties of a permutation puzzle are satis- 
fied for this puzzle. 


4.5.4 nxn Rubik’s cube 


Other than the 2 x 2, 3 x 3, and 4 x 4 cubes, the only other Rubik’s cube 
manufactured, as far as I know, is the 5 x 5 Rubik’s cube. Apparently, 
there are mechanical problems which cause the manufacture of the n x n 
cubes to be overly expensive or perhaps even impossible, for n large. For 
information, at least theoretically, on the solution of such cubes, the reader 
might be interested in the article [L] or the postings in the archives of the 
*cube-lovers” list [CL]. 


4.6 Skewb 


The skewb is a cube which has been subdivided into regions differently than 
the Rubik’s cube. First, fix an orientation of the cube in space, so we may 
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talk about a front face, a back face, up, down, left, and right. Each of these 
6 square faces are subdivided into 5 facets as follows: (see Fig. 4.11) 


The solved skewb [front view) 


Figure 4.11: The solved skewb (front) 


The 4 corner facets are labeled exactly as in the case of the Rubik’s cube 
(as the lower case xyz, where x is the label of the face the facet lives on, y 
and z the two neighboring faces). 

The skewb itself is a cube subdivided as follows: there are 8 corner pieces 
which are each in the shape of a tetrahedron. For example, if you hold a 
cube in front of you the upper right hand corner of the front face is the facet 
of a tetrahedron whose facets are labels fy, 174, te. 

The moves of the skewb are different from the Rubik’s cube as well: Label 
the corners as X Y Z, where xyz is the notation for any of the facets belonging 
to that corner piece. Pick a corner XY Z of the cube and draw a line L passing 
through that corner vertex and the opposite corner vertex (”"skewering the 
cube”). That line defines a 120 degree rotation in the clockwise direction 
(viewed from the line looking down onto the corner you picked). One move 
of the skewb is defined in terms of this rotation as follows: Of course a 120 
degree rotation of the entire cube about the line L will preserve the cube but 
swap some faces and some vertices. The skewb has a mechanism so that you 
can actually rotate half (a ”skewed” half) the skewb by 120 degrees about 
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L and leave the other half completely fixed. This rotation of half the skewb 
about L will also be denoted XY Z. 


We may also label the 5-6 = 30 facets as follows: 


tooo cocoon + 

| 20 17 | 

| 16 u | 

| 19 18 | 
tooo cocoon to------- oo tooo cocoon t---------- + 
| 5 2 | 10 7 | 25 22 | 30 27 | 
| 1 1 | 6 f | 21 xr | 26 b | 
| 4 3 | 9 8 | 24 23 | 29 28 | 
tooo o-oo to------- oo tooo cocoon t---------- + 

| 15 12 | 

| 141 da | 

| 14 13 | 

tooo o-oo + 


Example 61 Consider the rotation UFR associated to the corner ufr. This 
move permutes the facets of the skewb. As a permutation, the disjoint cycle 
notation for this move is 


UFR = (6 16 21)(7 18 25)(10 17 24)(8 19 22). 


Note, in particular UFR does not move the 9-facet. 
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The eight basic moves are given by 


FUR = (6 16 21)(7 18 25)(10 17 24)(8 19 22) 
RUB = (21 16 26)(22 17 30)(25 20 29)(23 18 27) 
BUL = (26 16 1)(27 20 5)(28 17 2)(30 19 4) 
LUF = (1 16 6)(2 19 10)(5 18 9)(3 20 7) 
FDR = (11 6 21)(25 13 9)(23 15 7)(24 12 8) 
BDR = (26 11 21)(29 13 23)(27 12 22)(30 14 24) 
FDL = (611 1)(9 15 3)(10 12 4)(8 14 2) 
LDB = (1 11 26)(3 13 27)(4 14 28)(5 15 29). 


Nee Ne 
nn 


All other moves are obtained by combining these moves sequentially. 
The reader who wishes may check these by xeroxing the above diagram, 
cutting it out and taping it together. 


Exercise 4.6.1 Verify that the properties of a permutation puzzle are satis- 
fied for this puzzle. 


4.7 Pyraminx 


The pyraminx is a puzzle in the shape of a tetrahedron. A tetrahedron is 
a 4-sided regular platonic solid, all of whose faces are equilateral triangles. 
Each of the 4 faces of the puzzle is divided into 9 triangular facets: (see Fig. 
4.12) 

There are a total of 4-9 = 36 facets on the pyraminx. They will be 
labeled as follows (the reader may want to xerox this, cut it out, then fold 
the corners and tape it into a tetrahedron): (see Fig 4.13) 

We fix an orientation of the tetrahedron in space so that you are looking 
at a face which we call the ”front”. We may also speak of a ”right”, ” left”, 
and ”down” face. We label the 4 faces as f(ront), r(ight), (eft), d(own). We 
label the vertices U(p), R(ight), L(eft), and B(ack). 

The tetrahedron itself has been subdivided into sub-tetrahedrons as fol- 
lows: to each vertex X (so X € {U,R,L, B}) there is an opposing face F of 
the solid. For each such face, we slice the solid along two planes parallel to 
the vertex X and lying in between the face and the vertex. We want these 
planes, along with the face and the vertex to be spaced apart equally. The 
sub-tetrahedrons in the slice of the face itself will be called the face slice as- 
sociated to the face fF’, denoted F,, the sub-tetrahedrons in the middle slice 
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pyraminx 


Figure 4.12: Pyraminx 


a anal a 
VV AAV EVAN 


Figure 4.13: 


ANE VEEN 
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parallel to the face F’ will be called the middle slice associated to that face, 
denoted fy, and the sub-tetrahedron containing the vertex X to the face tip 
associated to that vertex, denoted F3. 

To each face labeled F’, we have a clockwise rotation by 120 degrees of 
the first slice F, of the face. We shall denote this rotation also by F,. This 
rotation only moves the facets living on the slice F,. Similarly, we have a 
clockwise rotation by 120 degrees of the second slice Fy of the face. We 
shall denote this rotation also by Fy. Fs denotes the clockwise rotation by 
120 degrees of the opposing sub-tetrahedron containing the vertex X. These 
moves permute the labels for the 36 facets, hence may be regarded as a 
permutation of the numbers 1,2,...,36. 

For example, the clockwise rotation by 120 degrees (looking at the front 
face) of the sub-tetrahedron opposite to the front face will be denoted F3. 
The disjoint cycle notation for this move, regarded as a permutation, is 


F3 = (23 22 36). 
The basic moves are given as follows: 


F, = (2 32 27)(8 31 26)(7 30 12)(19 29 11)x 
«(18 28 3)(1 17 13)(6 15 4)(5 16 14) 
Fy = (9 35 25)(21 34 24)(20 33 10) 
Fs, = (23 22 36 


—~ 


R, = (3 36 17)(11 34 16)(10 35 6)x 
(24 31 5)(23 32 1)(2 22 18)(9 20 7)(8 21 19) 

Ry = (12 33 15)(26 29 14)(25 30 4) 

= (27 28 13) 
L,=(1 i‘ 22)(5 29 21)(4 33 9)x 
(14 34 8)(13 36 2)(3 27 23)(11 26 24)(12 25 10) 

L, =(6 30 20)(16 31 19)(15 35 7) 

= (17 32 18) 


Dy = (13 is 93)(14 19 24)(15 20 25)x 
«(16 21 26)(17 22 27)(28 32 36)(29 31 34)(30 35 33) 
Dy = (47 10)(5 8 11)(6 9 12) 
D3 = (1 23) 


All other moves are obtained by combining these moves sequentially. In- 
deed, later, we shall want to use moves of the form fF * F3, for each face 
F, but the disjoint cycle notation for these permutations are a little more 
cumbersome to write down. 


4.8. MEGAMINX sl 


Exercise 4.7.1 Verify that the properties of a permutation puzzle are satis- 
fied for this puzzle. 


4.8 Megaminx 


This puzzle is in the shape of a dodecahedron. A dodecahedron is a 12-sided 
regular platonic solid for which each of the 12 faces is a pentagon. We call 
two faces neighboring if they share an edge. There are 20 vertices and 30 
edges on a dodecahedron. 

Each of the puzzle faces has been subdivided into 11 facets by slicing each 
edge with a cut which is both parallel to that edge and not far from the edge 
(say one-fifth the way to the opposite vertex). A picture is as follows:(see 
Fig. 4.14) 


os ae a a 
i i eV— 
i a . b 
h k x c 
e |. d 
ae f P ie 
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Figure 4.14: Megaminx 


There are a total of 11-12 = 132 facets on the puzzle. Each face of the solid 
is parallel to a face on the opposite side. Fix a face of the dodecahedron and 
consider a plane parallel to that face slicing through the solid and about one- 
fifth the way to the opposite face. There are 12 such slices. Two such slices 
associated to two neighboring edges will intersect inside the dodecahedron 
at a 120 degree angle but two such slices associated to two non-neighboring 
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edges will not intersect inside the dodecahedron (though they will intersect 
outside the solid of course). We slice up the solid dodecahedron in this way. 
This creates a smaller dodecahedron in the center and several other irregular 
smaller pieces. 

For each such slice associated to a given face f; there is a basic move 
still denoted f; of the megaminx given by clockwise rotating the slice of the 
megaminx by 120 degree, leaving the rest of the dodecahedron invariant. 
Such a move effects 26 facets of the megaminx and leaves the remaining 106 
facets completely fixed. 

Label the 12 faces of the solid as fi, fo,..., fig in some fixed way. Imagine 
that the dodecahedron is placed in 3-space in such a way that one side on the 
xy-plane and is centered along the positive z-axis so that one of the vertices 
of the top face is at the xyz-coordinate (r,0,s), where r is the radius of the 
inscribed circle for the pentagon and s is the distance from the up” face to 
the ”down” face of the dodecahedron. 


Exercise 4.8.1 Supposer = 1. Find s. (This is fairly hard - see the chapter 
on Platonic solids for some ideas.) 


The up face we label as f;. The others may be labeled according to the 
following graph, where faces are represented by vertices and two vertices are 
connected by an edge if the corresponding faces are neighboring.(see Fig. 
4.15) 

A more symmetric way to order the faces of the dodecahedron is as follows 


(see [B], exercise 18.35): 


fi U 
fo | Uo 
fs U4 
ta U2 
fs U3 


fo | U4 
fr | de 
fs | ds 
Io | dg 
fiz | d 
fu | & 


fio do 
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Figure 4.15: 


One property of this labeling is explained in the following 


Exercise 4.8.2 Suppose that the permutation (0 1 2 3 4) of the numbers 
{0,1,2,3,4} acts on the labels uo,...,ua and do,...,d4 in the obvious way. 
Show that this permutation of the faces corresponds to a rotation of the do- 
decahedron. 


Notice that, like the cube, each vertex is uniquely determined by speci- 
fying the three faces it has in common. We use the notation z.y.z for the 
vertex of the dodecahedron which lies on the three faces x,y,z. Note that 
the order is irrelevent: x.y.z denotes the same vertex as y.@.z or z.y.a. 

The facets of the megaminx may be specified as with the Rubik’s cube: 
a corner facet may be specified as [x.y.z], where x is the face the facet lives 
on and y, z are the two neighboring faces of the facet. An edge facet may 
be specified by [x.y], where x is the face the facet lives on and y is the other 
neighboring face of the facet. The center facet of f,; will simply be denoted 
by [fi]. We will call this label the intrinsic label. 


We may label the facets of the up face f; as follows: 
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fi facet symbol | numerical label | intrinsic label 


a 
ah 
os 
w 


Oo oOn DD oF KP WLW db 
=> 
oS 
ran 


Ao ese Te oa SY 


ro 
re @ 
me 
oP 


For the next face (the fg face), we label the facets in such a way that the 
abc edge of f; joins the ghz edge of fa: 


fo facet symbol | numerical label | intrinsic label 
a 12 fo-fe-t7 
b 13 fofr 
Cc I4 fo-fr-fs 
d 15 So-fs 
e€ 16 fo.fs-fs 
f 17 fofs 
& 18 fo-fa-A 
h 19 fof 
1 20 Si-fs-fo 
J 21 Fo-fe 
k 22 tf 


In general, we can label the remaining facets in such a way that the 
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basic moves are, as permutations, given by: 


fr =(135 7 9)(2 46 8 10)(20 31 42 53 64)x 
x(19 30 41 52 63)(18 29 40 51 62) 
fo = (12 14 16 18 20)(13 15 17 19 21)(1 60 73 84 31)x 
(3 62 75 86 23)(2 61 74 85 32) 
fs = (23 25 27 29 31)(24 26 28 30 32)(82 95 42 3 16)x 
x (83 96 43 4 17)(84 97 34 5 18) 
fa = (34 36 38 40 42)(35 37 39 41 43)(27 93 106 53 5)x 
(28 94 107 54 6)(29 95 108 45 7) 
fs = (45 47 49 51 53)(46 48 50 52 54)(38 104 117 64 7)x 
x (39 105 118 65 8)(40 106 119 56 9) 
fs = (56 58 60 62 64)(57 59 61 63 65)(49 115 75 20 9)x 
x(50 116 76 21 10)(51 117 67 12 1) 
fr = (67 69 71 73 75)(68 70 72 74 76)(58 113 126 86 12)x 
x(59 114 127 7 13)(60 115 128 78 14) 


fs = (78 80 82 84 86)(79 81 83 85 87)(71 124 97 23 14)x 
x(72 125 98 24 15)(73 126 89 25 16) 
fo = (89 91 93 95 97)(90 92 94 96 98)(80 122 108 34 25)x 
x(81 123 109 35 26)(82 124 100 36 27) 
fio = (100 102 104 106 108)(101 103 105 107 109)x 
x(91 130 119 45 36)(92 131 120 46 37)(93 122 111 47 38) 
fur = (111 113 115 117 119)(112 114 116 118 120)x 
(102 128 67 56 47)(103 129 68 57 48)x 
(104 130 69 58 49) 
fio = (122 124 126 128 130)(123 125 127 129 131)x 
x(100 89 78 69 111)(101 90 79 70 112)(102 91 80 71 113) 


4.9 Other permutation puzzles 


I have left out several puzzles: ”topspin” and ”turnstile” (planar puzzles), 
*mozaika” (an equator-like puzzle, but the hemispheres may be rotated inde- 
pendently), ”alexander’s star” (a stellated icosahedron), ”the ”impossiball” 
(a spherically shaped icosahedron - see [H]), ”mickey’s challenge” (a spher- 
ically shaped irregular polyhedron - essentially the same as the skewb but 


with some added orientations of faces). 
The puzzle ”Christoph’s jewel”, essentially a ”Rubik octahedron”, may 


be solved using ”super-Rubik’s cube moves” (see [H]). (Indeed, one may take 
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a Rubik’s cube, strip off all the stickers (using soap and water), and replace 
them with new stickers modeling a Rubik octahedron. This is because the 
octahedron is the dual solid of the cube, as described in chapter 7 below 
”Symmetry groups of the Platonic solids” .) 

The ”orbix” puzzle (a battery run puzzle which has 12 buttons which light 
up) is a permutation puzzle if you think of a move (which switches certain 
of the buttons on/off) as permuting the elements of the set of all subsets of 
the 12 buttons (the subset of buttons which are lit) amongst themselves. 

There is some mention of such puzzles in, for example, [Si], [H], [B], [GT] 
and [Jwww]. 


Chapter 5 
Groups, I 


Q: ”What’s commutative and purple?” 
A: ” An abelian grape”. 
- Ancient Math Joke 


”In 1910 the mathematician Oswald Veblen and the physicist 
James Jeans were discussing the reform of the mathematical cur- 
riculum at Princeton University. ‘We may as well cut out group 
theory,’ said Jeans. ‘That is a subject which will never be of any 
use to physics.‘ It is not recorded whether Veblen disputed Jeans’ 
point, or whether he argued for the retention of group theory on 
purely mathematical grounds. All we know is that group theory 
continued to be taught. And Veblen’s disregard for Jeans’ advice 
continued to be of some importance to the history of science at 
Princeton. By the irony of fate group theory later grew into one 
of the central themes of physics, and it still dominates the think- 
ing of all of us who are struggling to understand the fundamental 
particles of nature.” 

Freeman J. Dyson 


SCIENTIFIC AMERICAN, Sep, 1964 


When we studied permuation puzzles in Chapter 4, recall that one of 
the criteria was that each move was invertible”. This is, in fact, one of 
the conditions for the set of all legal moves of a permutation puzzle to form 
a group. A group is a set G with a binary operation (namely a function 
*:G x G —) satisfying certain properties to be given later, one of which is 


87 


88 CHAPTER 5. GROUPS, I 


that each element has an inverse element associated to it. One should be a 
little careful, since not every permutation puzzle gives rise to a group in this 
way. For example, the set of moves of the 15 puzzle do not form a group in 
this way though the set of moves of the Rubik’s cube group do. 

Just as for sets, we must decide on how to describe a group. If G is 
finite then one way is to list all the elements in G' and list (or tabulate) 
all the values of the function *. Another method is to describe G in terms 
of some properties and then define a binary operation * on G. A third 
method is to give a ’presentation” of G. Each of these has its advantages and 
disadvantages. We shall eventually introduce all three of these approaches. 

First, we start with an example. 


5.1 The symmetric group 


Before defining anything, we shall provide a little motivation for some general 
notions which will arise later. 
Let X be any finite set and let Sx denote the set of all permutations of 
X onto itself: 
Sy ={f:X > X | f is a bijection}. 


This set has the following properties: 


1. if f,g belong to Sy then fg (the composition of these permutations) 
also belongs to Sy, ("closed under compositions” ), 


2. if f,g,h all belong to Sy then (fg)h = f(gh), associativity” ), 


3. the identity permutation J: X — X belongs to Sy ("existence of the 
identity” ), 


4. if f belongs to Sy then the inverse permutation f~' also belongs to Sy 
("existence of the inverse” ). 


The set Sx is called the symmetric group of X. We shall usually take for 
the set X a set of the form {1,2,...,2}, in which case we shall denote the sym- 


metric group by S,. This group is also called the symmetric group on n letters. 


Example 62 : Suppose X = {1,2,3}. We can describe Sx as 


Sy = {I,5, = (1 2), 82 = (2 3), 53 = (1 3 2),54 = (1 2 3),85 = (1 3)}. 


5.2. GENERAL DEFINITIONS 89 


We can compute all possible products of two elements of the group and tab- 
ulate them in a multiplication table is 


Exercise 5.1.1 Verify the four properties of Sx mentioned above. (Note 
that the verification of associativity follows from the associative property of 
the composition of functions - see the Exercise 3.0.4). 


5.2 General definitions 


We take the above four properties of the symmetric group as the four defining 
properties of a group: 


Definition 63 Let G be a set and suppose that there is a mapping 


*:GxGxG 
(91,92) > 91 * 2 


(called the group’s operation) satisfying 
(G1) tf 91,92 belong to G thn gi * go belongs to G (”G is closed under *”), 
(G2) tf 91,92, 93 belong to G then (gi *g2)*93 = g1*(g2*ga) (“associativity”), 


(G3) there is an element 1 € G such thatl*g =g*1=g forallg eG 
(“existence of an identity”), 


4) of g belongs toG then there is an element g—~ € G, called the inverse _o 
G4) if g bel G then th ' eG, called th fg 


such that gx g7' =g7'*g =1 (“existence of inverse”). 


Then G (along with the operation *) is a group. 
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Example 64 Actually, this is a *non-example”. Let S be the set of all legal 
moves (one can eventually make from a legally obtained position) of the 15 
puzzle (as described in Chapter 4). In a given position, for ecample the solved 
position, there arent that many possibilities: there are only 2 moves in the 
solved position and there are never any more that 4 moves possible from any 
position. 

From the solved position one can move (15,16) and (12,16) (where 16 
denotes the blank square) but not for example (1,16). Since (15, 16), (1,16) € 
S and since (1,16)(15, 16) is not a legal move, it follows that composition of 
legal moves is not always legal. This shows that composition is not a binary 
operation, so property number (G1) fails to hold. 


In the above definition, we have not assumed that there was exactly one 
identity element 1 of G' because, in fact, one can show that if there is one then 
it is unique. (To do this you need to use the cancellation law: if a*c = b*c, 
where a,b,c € G, then a = 6.) Likewise, if G is a group and g € G then the 
inverse element of g is unique. There are other properties of a group which 
can be derived from (G1)-(G4). We shall prove them as needed. 

The multiplication table of a finite group G is a tabulation of the values 
of the binary operation *. Let G = {g1,...,g,}. The multiplication table of 
G is: 


© lg ga we Gj we Gn 
fi 

g2 

Gi Gi * Gy 

Gn 


Some properties: 


Lemma 65 (a) Each element gy, € G occurs exactly once in each row of the 
table. 

(b) Each element g, € G occurs exactly once in each column of the table. 

(c) If the (2,7) entry of the table is equal to the (j, 7)" entry then g;*g; = 
G5 * Gi- 

(d) If the table is symmetric about the diagonal then g*h=hx*g for all 
g,h €G. (In this case, we call G abelian.) 
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Example 66 Let Cy, be the group whose elements are {0,1,...,11} and for 
which the group operation is simply “addition mod 12”, just as one adds time 
on a clock (except that we call 712 0’clock” ”0 o’clock”). Thus 5+8 = 1, 
1+11=0, and so on. 

Question: What is the inverse element of 5? The inverse of 1? 

This group is called the cyclic group of order 12. 


Exercise 5.2.1 Compute the multiplication table for Cy. 


Definition 67 Let n > 1 be an integer and let C,, be the group whose el- 
ements are {0,1,...,2 —1} (more precisely, {0,1,...,.2 —1}, where i is the 
residue class mod n of t) and for which the group operation is simply “addi- 
tion modn”. This group is called the cyclic group of order n. 


For further details on cyclic groups, see for example [G] or [R]. 


Definition 68 Let g and h be two elements of a group G. We say that g 
commutes with h (or that g,h_commute) ifg*h = h*g. We call a group 
commutative (or “abelian”) if every pair of elements g,h belonging to G 
commute. If G is a group which is not necessarily commutative then we call 
G noncommutative (or “nonabelian”). 


Example 69 The integers, with ordinary addition as the group operation, 
is an abelian group. 


Exercise 5.2.2 Show that any group having exactly 2 elements is abelian. 


Now the reader should understand the punchline to the joke quoted at 
the beginning! 

Convention: When dealing with groups in general we often drop the 
* and denote multiplication simply by juxtaposition (that is, sometimes we 
write gh in place of g*h), with one exception. If the group G is abelian then 
one often replaces * by + and then + is not dropped. 

Now that we know the definition of a group, the question arises: how 
might they be described? The simplest answer is that we describe a group 
much as we might describe a set: we could list all its elements and give the 
multiplication table or we could describe all its elements and their multipli- 
cation in terms of some property from which we can verify the four properties 
of group. Though the first way has the distinct advantage of being explicit, it 
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is this second alternative which is the most common since it is usually more 
concise. 

Our objective is to introduce terminology and techniques which enable us 
to analyse mathematically permutation puzzles. The type of groups which 
arise in this context are defined next. 


Definition 70 Let X be a finite set. Let 91, 92,...,gn be a finite set of ele- 
ments of permutations of X (so that they all belong to Sy). Let G be the set 
of all possible products of the form 


G9 = Ty * LQ... * Lp, m > 0, 
where each of the 21,...,2%m is taken from the set {q1,...,gn}. The set G, 


together with the group operation given by composition of permutations, is 
called a permutation group with generators g1,...,gn. We sometimes write 


G=< Piso Gn PO Sx. 
It is not too hard to justify our terminology: 
Lemma 71 A permutation group is a group. 


proof: Let G be a permutation group as in the above definition. We 
shall only prove that each g € G has an inverse, leaving the remainder of the 
properties for the reader to verify. 

The set {g” |n > 1} C Sy is finite. There are ny > 0, no > ny such that 


g™ = 9g". Then g7! = g™7™7! since g- g@7™ 71 = 1. 


Remark 3 The above definition can be generalized: Replace Sx by any group 
S which includes all the generators g,,...,g,. The resulting set G is called 
the group generated by the elements qy, ....Gn- 


Algorithm: 
Input: The generators g1,...,gn (as permutations in Sy), 
Output: The elements of G, 


S= {9155 Gn Ip se In}s 
L=Su{i, 
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for g in 5 do 
for h in L do 
if g*h not in L then L = L union {g*h} endif 
endfor 
endfor 


Note that the size of the list L in the for loop changes after each iteration 
of the loop. The meaning of this is that the if-then command is to be executed 
exactly once for each element of the list. 


Exercise 5.2.3 Verify that permutation group G satisfies the four properties 
of a group (G1)-(G4). 


Definition 72 [fG is a group then the order of G, denoted |G], is the num- 
ber of elements of G. If g is an element of the group G then the order of 
g, denoted ord(g), is the smallest positive integer m such that g” = 1, if it 
exists. If such an integer m does not exist then we say that g has “infinite 
order”. 


Example 73 For example, there is an even permutation of order 42 in Sip, 
for example (1,2)(3,4,5)(6,7,8,9,10, 11,12), and an odd permutation of or- 
der 15 in Sg, for example (1,2,3)(4,5,6,7,8). 

Singmaster [Si] states that the maximal order in the Rubik’s cube group 
is 1260. 


We shall be able to make use of the following fact frequently. 


Theorem 74 (a) (Cauchy) Let p be a prime dividing |G|. There isageG 
of order p. 

(b) (Lagrange) Let n be an integer not dividing |G|. There does not exist 
ag €G of ordern. 


This will be proven a little later. 

As an application of this: we shall see later that the Rubik’s cube group G 
has the property that |G| = 2?73"5°7711. It follows from this and Lagrange’s 
theorem that there is no move of the Rubik’s cube of order 13 but there is 


one of order 11. 
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Exercise 5.2.4 Let X = {1,2,3}. We use the notation of the example 
above. 

(a) Let G be the permutation group with generator 31,G =< 5, >. Verify 
that there are only two elements in G. 

(b) What is the order of 85? 

(c) Let G be the permutation group with generator 83,G =< 83 >. Verify 
that there are only three elements in G. 

(d) Find the order of 83. 

(e) Show that Sy =< 81,8) >. 


Definition 75 /f G is a permutation group G with only one generator then 
we say that G is cyclic. 


Lemma 76 If G =< gq > is cyclic with generator g then |G| = ord(q). 


proof: Let m = ord(g), so g” = 1. We can list all the elements of G as 
follows: 


m1 


Lg, G59 


There are m elements in this list. 


5.2.1 The Gordon game 
Let G be a finite group, written 


G= {Jo = la, 15 In} 


You and your opponent share a set of move tokens, denoted 


M= {n, ws Inds 


and place tokens, denoted 


P= {915-5 In}: 
Rules to play: 


e Players alternate turns. Each turn consists of removing one move token 
and one place token according to the conditions listed below. The first 
person who cannot make a legal play loses. 


Let mp = po = 1 and let 2 = 1. 
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e First player picks any move token m, € M and the place token p,; = 
m, € P. These tokens m, and p, are then removed from M and P, 
resp.. 


e The next player picks any move token m;4, such that p44 = mip; € 
P. These tokens m4, and p;41 are then removed from M and P, resp.. 


e Increment 2 and go to the previous step. 


Example 77 Let 


G =F, = {0,1,2,3,4,5,6}. 


The moves of a game are determined by recording the move tokens. One 
possible game is 

e 4 1 3.2 

012 3 4 5 6 
where the e over the identity element 0 of the group indicates that it isn’t 
moved and the numbers above a group element indicates when it was moved: 


1%: myz= 2, A= 2; 24; mg 4, p2= 6; 
1%: m3 = 6, P3 = 5; 24; ma = 3, pa = 1; 


2"4 ylayer wins 
Exercise 5.2.5 Play a game! 


Remark 4 I[fG = Z/pZ (the cyclic group with p elements) there is a conjec- 
ture that the 2"¢ player has a winning strategy when p > 5 (see Isbell’s note 
[I]). In general, strategies are not not only not known, they haven't even been 
conjectured. 


Remark 5 [f you and your opponent both try to drag the game on as long as 
possible, can you exhaust the set of move tokens and the set of place tokens? 
The answer is known for abelian groups, dihedral groups and groups of order 
< 32. The general answer is unknown. 
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5.3. Subgroups 


Definition 78 Let G be a group. A subgroup of G is a subset H of G such 
that H, together with the operation * inherited as a subset of G, satisfies the 
group operations (G1)-(G4) (with G replaced by H everywhere). 


Notation: If G is a group then we will denote the statement ”H is a 
subgroup of G” by 
A < G. 


Problem: What are the subgroups of the Rubik’s cube group? It turns 
out that there are too many to list but later, when we have a more useful 
way of describing a group (using generators and relations - see §9.3), we will 
explicitly determine some of the subgroups of ”small” order. 


Theorem 79 (Lagrange) Let H be a subgroup of a finite group G. Then |H| 
divides |G. 


proof: For z,y € G, define x ~ y if eH = yH, where 
tH ={xxh|he Hy}. 


This is an equivalence relation (Exercise: Check reflexive, symmetry, and 
transitivity). Moreover, the equivalence class of x consists of all elements in 
G of the form x*h, for some h € H, ie., [a] = cH. Let g1,...,Gm € G denote 
a complete set of representatives for the equivalence classes of G. Because 
of the cancellation law for groups, |v«H| = |H]| for each x € G. Furthermore, 
we know that the equivalence classes partition G, so 


G= Ur Lgl = Ur gill. 


Comparing cardinalities of both sides, we obtain |G| = |g. H|+...+|gmnH| = 
m|H|. This proves the theorem. 


Definition 80 /f H and G are finite groups and H < G then the integer 
|G'|/|H| ts called the index of H in G, denoted [|G : H] = |G|/|A|. 


Exercise 5.3.1 Show, as a corollary to the previous Theorem 79, that The- 
orem 74 is true. 
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Example 81 A permutation group G generated by elements gy, ..., gn belong- 
ing to Sx is a subgroup of Sx, t.e., G< Sx. 


Example 82 Let 
Ax = {g € Sx | g is even}. 


This is a subgroup of S,, called the alternating subgroup of degree n . 


Definition 83 The center of a group G is the subgroup Z(G) of all elements 
which commute with every element of G: 


Z(G) ={zE€G|z*g= 92, for all g EG}. 


Of course, the identity element always belongs to G. If the identity ele- 
ment is the only element of 7(G) then we say G has trivial center. On the 
other hand, G is commutative if and only if G = Z(G). 


Exercise 5.3.2 Let G= S3. Determine Z(G). 


5.4 Examples of groups 


Example 84 The collection of all moves of the 15 puzzle may be viewed as 
a subgroup of Sy6. 


Example 85 The collection of all moves of the Rubik’s cube may be viewed 
as a subgroup G of Sys. The center of G consists of exactly two elements, 
the identity and the “superflip” move which has the effect of flipping over 
every edge, leaving all the corners alone and leaving all the subcubes in their 
original position. One move for the superflip is 


superflip = R* L* Fh * BxeU* De Re L* F* Be U * F? « Mpx 
4B? * U-! & MZ * B? * My! * B? x U x M2 « D, 


where Mp is middle right slice rotation by 90 degrees (viewed from the right 
face). The proof of this fact uses the determination of the group structure of 
G' given later (see also [B]). 
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5.4.1 The dihedral group 


Pick an integer n > 2 and let R be a regular n-gon centered about the origin 
in the plane. If n = 3 then RA is an equilateral triangle, ifn = 4 then Risa 
square, ifn = 5 then R is a pentagon, and so on. Let G denote the set of all 
linear transformations of the plane ' to itself which preserve the figure R. The 
binary operation 0: G x G — G given by composition of functions gives G 
the structure of a group. This group is called the group of symmetries of R. 


Label the vertices of the n-gon as 1, 2, ..., n. The group G permutes 
these vertices amongst themselves, hence each g € G' may be regarded as 
a permutation of the set of vertices V = {1,2,...,n}. In this way, we may 
regard G as a permutation group since it is the subgroup of 5, generated by 
the elements of G. 

The fact that this group has 2n elements follows from a simple counting 
argument: Let r € G denote the element which rotates R by 2a/n radians 
counterclockwise about the center. Let L be a line of symmetry of R which 
bisects the figure into two halves. Let s denote the element of G which is 
reflection about L. There are n rotations by a multiple of 27/n radians 
about the center in G: 1,r,r?,...,7 There are n elements of G which 
are composed of a reflection about LE and a rotations by a multiple of 27/n 


n-1 


radians about the center: s,sor,sor’,...,sor”'. These comprise all the 
elements of G. 

One remarkable property of this symmetry group, which we shall use in 
the example in the next section, is that it is generated by any two distinct 
reflections in the group: 


Lemma 86 Pick two distinct lines L,L' of symmetries of R, each of which 
bisects R in half, and let s, s' (resp.) denote the corresponding reflections, 
regarded as elements in S,. Then G =< s,s'>. 


The interested reader is referred to [NST], [R], or [Ar], chapter 5, §3, for 
a proof. 
The symmetry group of R is known as the dihedral group of order 2n, 


denoted D2,. We shall state the precise relation in a later chapter (chapter 
8) after we have introduced more terminology. 


‘Tf we regard R as a figure in 3-space centered above the origin and let G denote the 
set of all linear transformations of 3-space then we obtain a slightly larger group in some 
cases [NST]. 
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Example 87 Let G be the symmetry group of the square: i.e., the group of 
symmetries of the square generated by the rigid motions 


go = 90 degrees clockwise rotation about O, 
g1 = reflection about 4, 
g2 = reflection about @2, 
g3 = reflection about ¢3, 
ga = reflection about ¢4, 


where 1, l2,¢3 denote the lines of symmetry in the picture below: (see Fig 5.1) 


Figure 5.1: 


The elements of G are 


1, 05 95> Go» 91> 925 935 94: 


Let X be the set of vertices of the square. Then G acts on X. 


5.4.2 Example: The two squares group 


This material is based on an idea mentioned in [FS]. 
Let H =< R?,U? > denote the group generated by the two square moves, 
R? and U? or the Rubik’s cube. (The reader with a cube in hand may want 


100 CHAPTER 5. GROUPS, I 


to try the Singmaster magic grip : the thumb and forefinger of the right 

hand are placed on the front and back face of the fr, br edge, the thumb and 

forefinger of the left hand are placed on the front and back face of the uf, ub 

edge; all moves in this group can be made without taking your fingers off the 

cube.) This group contains the useful 2-pair edge swap move (R? * U?)?. 
We can find all the elements in this group fairly easily: 


Hei bel el en el) ee ee 
GPP ee ely i el ad i el ed ely ade} 


Therefore, |H| = 12. Note that 1 = (R? * U*)®, U? = (R? * U’)? * R?, and 
U? * R? = (R? *U*)°. (By the way, this listing without repetition of H by 
successive multiplication by R? then U? may be reformulated graphically by 
saying the ”the Cayley graph of H with generators R?, U? has a Hamiltonian 
circuit”. This interpretation will be discussed in the next chapter.) 

To discover more about this group, we label the vertices of the cube as 


follows:(see Fig. 5.2) 


Figure 5.2: 


The move R? acts on the set of vertices by the permutation (1 4)(2 3) 
and the move U? acts on the set of vertices by the permutation (4 5)(3 6). 
We label the vertices of a hexagon as follows:(see Fig 5.3) 
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Figure 5.3: 


The permutation (1 4)(2 3) is simply the reflection about the line of 
symmetry containing both 5 and 6. The permutation (4 5)(3 6) is simply 
the reflection about the line of symmetry containing both 1 and 2. By a fact 
stated in section 5.4.1, these two reflections generate the symmetry group of 
the hexagon. 


5.5 Commutators 

Definition 88 /fg,h are two elements of a group G then we call the element 
[Ig hl=geheg'*«h" 

then commutator of g,h. 


Not that [g,h] = 1 if and only if g,k commute. Thus the commutator 
may be regarded as a rough measurement of the lack of commutativity. 


Exercise 5.5.1 Let G= 3, the symmetric group on 3 letters. Compute the 
commutators 


[31,52], [82,51]. 
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Exercise 5.5.2 Let R,U be as in the notation for the Rubik’s cube moves 
introduced in the previous chapter. Determine the order of the move [R,U]. 


(Ans: 6) 


Definition 89 (Singmaster [Si]) Let G be the permutation group generated 
by the permutations R,L,U,D,F,B regarded as permutations in S54. The 
Y commutator is the element 


[F,RO)=F*R'* PF l«R. 


The Z commutator ts the element 


[F,R]) = FP * Re Fol * RO. 


Exercise 5.5.3 (a) Find the orders of the Y commutator and the Z commu- 
tator. 


(b) Find the order of [R,[F,U]]. 


Example 90 If x,y are basic moves of the Rubtk’s cube associated to faces 
which share an edge then 
(a) [x,y]? permutes exactly 3 edges and does not permute any corners, 
(b) [x,y]? permutes exactly 2 pairs of corners and does not permute any 
edges. 


Definition 91 Let G be any group. The group G’ generated by all the com- 
mutators 


{lg,h] | g,h belong to G} 


This is called the commutator subgroup of G. 


This group may be regarded as a rough measurement of the lack of com- 
mutativity of the group G. 


Remark 6 We will see later that the group generated by the basic moves 
of the Rubik’s cube - R,L,U,D,F,B - has a relatively large commutator 
subgroup. In other words, roughly speaking “most” moves of the Rubik’s 
cube can be generated by commutators such as the Y commutator or the Z 
commutator. 
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Definition 92 If we repeatedly take commutator subgroups we get a series 
of groups G, G’, G" = (G")', and so on. The derived series of a group G is 
the sequence of subgroups 

wix(Gy! < G< G. 
A group G is called solvable if one of the groups in the derived series is the 
trivial group consisting only of the identity. 


Exercise 5.5.4 Let G be an abelian group. Show that G is solvable. 


5.6 Conjugation 


Definition 93 : [f g,h are two elements of a group G then we call the 
element 
g? =h!*qx*h 


the conjugation of g by h. 


Note that g” = 1 if and only if g,h commute. Thus the conjugates may 
be regarded as a rough measurement of the lack of commutativity. 


Exercise 5.6.1 Show g *[g7!,h7"] = g". 


Exercise 5.6.2 Let G = S3, the symmetric group on 3 letters, in the nota- 
tion of the example above. Compute the conjugations 


Exercise 5.6.3 Let R,U be as in the notation for the Rubik’s cube moves 
introduced in the previous chapter. Determine the order of the move R°. 


(Ans: 4) 


Definition 94 : We say two elements 91,92 of G are conjugate if there is 
an element h € G such that g. = gh. 


It turns out that it is easy to see when two permutations g,h € S, are 
conjugate: they are conjugate if and only if the cycles in their respective dis- 
joint cycle decompositions have the same length when assanged from shortest 
to longest. For example, the elements 


g = (6,9)(1, 3, 4)(2,5, 7,8), h = (1, 2)(3,4, 5)(6,7,8, 9) 
are conjugate. We shall leave the details and the proof for later - see 88.3.1 
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Exercise 5.6.4 Show that the notion of conjugate defines an equivalence 
relation. That is, show that 

(a) any element g € G is conjugate to itself (“reflexive”), 

(b) if g is conjugate to h (g,h belonging to G) then h is conjugate to g 
(“symmetry”), 

(c) if g: is conjugate to gz and gy is conjugate to gz then g, is conjugate 
to gs (“transitivity”). 


Notation: The set of equivalence classes of G under the equivalence 
relation given by conjugation, will be denoted G,. 


pa(t) = Ss" fora) 


gEGx 


The polynomial 


is called the generating polynomial of the order function on G. Note two 
elements which are conjugate must have the same order since (h7~'gh)” = 
(h-*gh)(h-'gh)...(h7'gh) = ho'g"h, for n = 1,2,... and gh EG. 

In [Si], §5.10D, D. Singmaster asks for the possible orders of the elements 
of the Rubik’s cube group and how many elements of each order there are. 
(A method for determining this will be described later in this text.) This 
question of Singmaster motivates the following: 

Problem: Determine pg(t) for the Rubik’s cube group. 


Example 95 For Ss, the generating polynomial is 


447 +42P +4442 450% +4742 4 04 77 44% 


and for Sy it is 


+ 24° + 1646 + t7 + 448 + 20° + 64194 
pzl4 | ppt | p18 | 9420 tt4 
8 3480 $35 ¢?2 780, 


t+ 6t? + 40° + 9t4 
fil gy12 
724 | #2 


(Both of these calculations were performed by MAPLE.) For example, it fol- 
lows that there is an even permutation of order 42 in Sz and an odd permu- 
tation of order 15 in Sg. 

Singmaster [Si] states that the maximal order in the Rubik’s cube group 
is 1260. 
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Definition 96 ; Fix an element g in a group G. The set 
Cl(g) = {hh egeh | heG} 


is called the conjugacy class of g in G. It is the equivalence class of the ele- 


ment g under the relation given by conjugation. 
If H is a subgroup of G' and if g is a fixed element of G' then the set 
Ho ={g'*h*xg|he H} 


is a subgroup of G. Such a subgroup of G' is called a subgroup conjugate to 
H. 


Exercise 5.6.5 Let S be the set of all subgroups of G. We define a relation 
Ron S by 


R= {(M,H2)€E 5x S | A, is conjugate to Hy}. 
Show that R is an equivalence relation. 


Exercise 5.6.6 LetG =), and let H =< g > be a cyclic subgroup generated 
by a permutation g of the set {1,2,....n}. With respect to the equivalence 
relation in the previous exercise, show that a subgroup K of G' belongs to the 
equivalence class |H| of H in G if any only if K is cyclic and is generated 
by an element k of G conjugate tog EG. 


5.7 Orbits and actions 


Definition 97 Let X be a set and let G be a group. We call X a G-set and 
we say G_acts on X_ provided the following conditions hold: 


1. each g belonging to G' gives rise to a function 
og: X aX, 
2. the identity 1 of the group G' defines the identity function on X, 
3. if g,h belong to G then the composite 
Qgn 2 X +X 
satisfies bh (x) = bn(Og(x)). 
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We call this action a left action since the left-most element (namely, g) in 
the product gh acts first. 


Similarly, we define 


Definition 98 Let X be a set and let G be a group. We say G_acts on X_on the right 
provided the following conditions hold: 


1. each g belonging to G gives rise to a function 


oy: X > X, 


2. the identity 1 of the group G' defines the identity function on X, 
3. if g,h belong to G then the composite 
Pgh :X 3X 


satisfies Ogn (x) = g(o,(x)). 


We call this action a right action since the right-most element (namely, h) 
in the product gh acts first. 


Remark 7 (1) We shall see another interpretation of these definitions in 
the later chapter entitled, “Groups, IT”. 
(2) Given a left action oy, one can create a right action by defining ¢, = 


by. 


Following the standard convention, the Rubik’s cube will act on the set 
of facets of the cube on the right. 


Definition 99 Let G act on a set X. We call the action transitive if for 
each pair x,y belonging to X there is ag €G such that y = @,(x). 


In other words, a group G' acts transitively on a set X if any element x 
of X can be send to any other element y of X by some element of G. 


Example 100 Let X be a finite set and let G = Sy be the symmetric group 
of X. Then X is a G-set and G acts transitively on X. 


Exercise 5.7.1 Show that the action in the previous example is transitive. 
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Example 101 Let G' be the group of all 2 x 2 invertible matrices with real 
entries, G = GL2(R). This group acts on the set of column vectors on the 
left. 


Exercise 5.7.2 Let G be the permutation group generated by the permuta- 
tions R,L,U,D,F,B, regarded as elements of S4g. Let E denote the set of 
edges of the cube, which we identify with the set of edge subcubes. Let V 
denote the set of vertices of the cube, which we identify with the set of corner 
subcubes of the cube. Let X be the set of all movable subcubes of the Rubik’s 
cube (which may identify as the union of E and V). Then G acts on X, E 
and G acts on V. 

Question (a) Is the action of G on X transitive? 

(a) Is the action of G on E transitive? 

(b) Is the action of G on V transitive? 


Exercise 5.7.3 Let G be a group and let X = G. Define left multiplication 
of G on X by: 


dg: X 3X 
rr o,(@)=g*x. 
(a) Show that left multiplication defines a left action of G on X. 


(b) Show that this action is transitive. 
(c) Show that each ¢,: G — G is a permutation of the set G, so dy € Sg. 


Exercise 5.7.4 Let G be a group and let X =G. Define right multiplication 
of G on X by: 


dg: X 3X 
rr o,(@) =a *g. 
(a) Show that right multiplication defines a right action of G on X. 


(b) Show that this action is transitive. 
(c) Show that each ¢,:G— G is a permutation of the set G, so oy € Sq. 


Exercise 5.7.5 Let G be a group and let X = G. Define conjugation on X 
by: 
og: X aX 


reod(r)=g  *urg. 


Show that conjugation defines an action of G on X (X and G as above). 
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Exercise 5.7.6 Let G be a group and let X denote the set of all subgroups 
of G. Define conjugation on X_ by: 
Og: X aX 
ps bgt) = gt 0g. 


Show that this defines an action of G on X. 


Remark 8 In general, the actions in the last two exercises are not transitive. 


Definition 102 Let G be a group acting on a set X. For each x belonging 
to X, the set 


Gra ={e,(x) |g € G} 
is called the orbit of x € X under G. 


Algorithm 

Input: A set S of generators of a permutation group G and an x belonging 
to X 

Output: The orbit of «, Gx x 


orbit = {x} 
for y in orbit do 
for g in 5 do 
if g*y not in orbit then orbit = orbit union {g*y} endif 
endfor 
endfor 


Note that the size of the list orbit in the for loop changes after each 
iteration of the loop. As mentioned before, the meaning of this is that the 
if-then command is to be executed exactly once for each element of the list. 


Exercise 5.7.7 Let G be the Rubik’s cube group and let x be the uf edge 
facet. Find the orbit of x under the action of G using the above algorithm. 
Show each step. 


Exercise 5.7.8 Let G be the group of moves of the Rubik’s cube and let X 
be the set of vertices of the cube. Let H be the subgroup of G generated by 
U« R. Find: 

(1) the order of U * R, (Ans: 105) 

(2) the orbit (in the Singmaster notation) of the ufr vertex in X under 
Hl. 
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Definition 103 Let G be a group acting on a set X with the action denoted 
by @. For each x belonging to X, the subgroup 


stabo(x) =G,={g EG | ¢,(x) =a} 


is called the stabilizer of x in G. 


Exercise 5.7.9 Let G be a group acting on a set X, o,: X > X, for all 
g € G. Show that, for alla € X and all g € G, we have stabg(¢,(x)) = 


g * stabe(x) *g7'. 


Example 104 Let G' be the group of symmetries of the square (see the ex- 
ample above), let X be the set of vertices of the square, and let xo be the 
vertex in the lower right hand corner. Then stabg(ao) =< gs >. 


Exercise 5.7.10 Let G be any group and let X = G. Let G act on X by 
left multiplication: 
og: X —X 


rr o(@)=g*x. 
Show that 
stabe(a) = 1, 


for all x belonging to X =G. 


Exercise 5.7.11 Let G be any group and let X = G. Let G act on X by 
conjugation: 
og: X aX 
rr, (r)=g*eueg'. 
Show that 
stabe(a) = {gE G|g*xx=x*g}, 


for all x belonging to X =G. (The subgroup 
Ce(a)={gEG|lgxx=axg} 


is called the centralizer of x in G.) 
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Example 105 Let X be the set of consisting of the 48 facets of the Rubtk’s 
cube which are not center facets - i.e., the “movable” facets. Let V denote 
the subset of facets which belong to some corner subcube, E’ the subset of 
facets which belong to some edge subcube. Let G denote the Rubik’s cube 
group. As noted above, G acts on X, V, E. The action of G on X induced 
an equivalence relation as follows: we say that a facet f, is “equivalent” to 
a facet fy if there is an element of G (i.e., a move of the Rubik’s cube) 
which sends one facet to the other. By exercise 5.7.2, there are exactly two 
equivalence classes, or orbits, of Gin X: V and E. In particular, the action 
of G on V is transitive and the action of G on FE is transitive. 


5.8 Cosets 


Let G be a group and H a subgroup of G. For g belonging to G, the subset 
g * H of G is called a left coset of H in G and the subset H *g of G' is called 
a right coset of H in G. 


Exercise 5.8.1 If H is finite, show |H| = |g * H| = |H * gJ. 


Exercise 5.8.2 If X is a left coset of H in G and x is an element of G, 
show that « * X is also a left coset of H in G. 


Notation: The set of all left cosets is written G/H and the set of all 
right cosets of H in G is denoted H\G. 

These two sets don’t in general inherit a group structure from G but they 
are useful none-the-less. (G'/H is a group with the “obvious” multiplication 
(gi * H) * (go * H) = (glg2) * A if and only if H is a ”normal” subgroup of 
G - we will define ”normal” below.) 

As an example of their usefulness, we have the following relationship 
between the orbits and the cosets of the stabilizers. 


Lemma 106 Let G be a finite group acting on a set X. Then 
IG * «| = |G/stabe(x)], 


for all x belonging to X. 


5.8. COSETS 111 


proof: The map 
g * stabg(x) Hog * a 


defines a function f : G/stabg(x) — Gx «x. This function is a bijection since 
it is both and injection (Exercise: Check this) and a surjection (Exercise: 


Check this). 


Exercise 5.8.3 Let G be the group of symmetries of the square. Using the 
notation above, compute G/ < g3 > and G * xo. 


Theorem 107 (Lagrange): If G is a finite group and H a subgroup then 
|G/H| = |G|/|H]. 


Corollary 108 /f H,G are as above then the order of H divides the order 
of G. 


proof of Theorem: Let X be the set of left cosets of H in G' and let G 
act on X by left multiplication. Apply the previous lemma with x = H. 


Exercise 5.8.4 Let G = $3, the symmetric group on 8 letters, and let H =< 
8, >, in the notation of §5.1 above. 

(a) Compute |G/H| using Lagrange’s Theorem. 

(b) Explicitly write down all the cosets of H in G. 


Definition 109 : Let H be a subgroup of G and let C be a left coset of H 
in G. We call an element g of G a coset representative of C if C = g* H. 
A complete set of coset representatives is a subset of G, 11, 22,...,%m, such 
that 


G/H = {x, * H,...,0%m, * H}, 


without repetition (t.e., all the x; * H are disjoint). 


Exercise 5.8.5 For gi, 92 € G, define g, ~ go if and only if g, and gz belong 
to the same left coset of H in G. 
(a) Show that ~ is an equivalence relation. 


(b) Show that the left cosets of H in G partition G. 
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5.9 Dimino’s Algorithm 


We saw in an earlier chapter an algorithm for computing all the elements of 
a permuation group G. We shall discuss a more efficient algorithm for doing 
this in this section. For more details, see [Bu]. 

Notation: Let S = {, 92,.--; Gn} bea set of generators for a permutation 
group G. Let 


So = 0, 
Si = {91,59}, 
Go = {1}, 


G; =< S; >= the group generated by the elements in S;, 


for |<i<n. 

Algorithm (inductive step): 

Input: The generators S of G and a list L of all the elements of the 
permutation subgroup G;_1. 

Output: A list LZ of elements of G; and a list C’ of coset representatives 
of G;/Gi-1. 


c = {1} 
for g in C do 
for s in S_i do 
if s*g not in L then 
C = C union {s*g} 
L = L union s*g*G_{i-1} 
endif 
endfor 
endfor 


Algorithm (Dimino): 
Input: The generators 5 of G 
Output: A list of elements of G 


(Initial case S_1i = <g1>) 
order = 1, element[1] = 1, g = gl 
while g <> 1 do 

order = order + 1 

element [order] = g 
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g = g*gl 
endwhile 


(General case) 
for 1 from 2 to n do 

<insert inductive step here> 
endfor 


Example 110 Let G = 53 =< 81,52 >. We use Dimono’s algorithm to list 
all the elements of G. We have 


Go = {1} < G, =< 55 >< Gp =G. 

First, we list the elements of G =< 8, >. Since 8, = (1 2), it is order 2, so 

G, = {1,51}. 
This is our list L which we will apply the “inductive step” of Dimino’s algo- 
rithm to (with i = 2). We start with C = {1}. Now we look at the left cosets 
of Gy in G2=G. We have (with g = 1,5 = 81) 

81 *G,=Gi, 
so we dont increase the size of C or L. Next, we have (with g = 1,8 = 82) 

82 * Gy = {52, 82 * 51} # G4, 
so L = {1, 81, 82, 82 * 81},C = {1,52}. Neat, we have (with g = 52,8 = 51) 
81 * 89 * Gy = {51 * 89, 81 * 82 * 54} A Gy. 


(We know 81 * 83 * G, # G, since neither of the two elements in 81 * 52 * Gy 
is the identity.) Thus, we increase L,C: 


LD = {1, 51, 82, 82 * $1, $1 * 82, 81 * 82 * 51}, 


and C = {1, 52,81 * 82}. We know we may stop here since we know |.53| = 6 
but the algorithm still has one more statement to execute. Next, we have 
(with g = 52,8 = 89) 

82 * 89 * Gy = Gi, 
so we dont increase the size of C or L (as expected). This step terminates 
the algorithm and S3 = L. 
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Exercise 5.9.1 Perform Dimino’s algorithm on 


S4a< s,=(1 2), 82 = (2 3), 53 = (3 4) >. 


5.10 Permutations and campanology 


This section is based on a capstone project of S. Robinson [Rob]. 

Standing outside of Westminster Abbey as the bells chime, the result you 
hear may actually be much more mathematical, than musical. While your 
ears think they detect melody,they are being deceived. The bells are not be- 
ing rung in melody at all; in actuality, they are being rung in permutations 
((Wh], p771). Since the seventeenth century, and possibly before, cathedral 
bells in England have been rung by such permutations or changes ([Wh], 
p771). The art and study of such bell ringing is referred to as campanology. 
While campanology had been around for at least a century before the for- 
malization of what is now known as group theory, elements of group theory 
are implicit in campanology. 

Fabian Stedman, referred to as “one of the “fathers of bell ringing,” is 
conjectured by Arthur White, to have been, perhaps, the first group theorist 
((Wh], p771). Born in 1640 to Reverend Francis Stedman, Fabian Stedman’s 
connections with campanology took root at the early age of 15 when he moved 
to London to work as an apprentice to a Master Printer. While in London 
Stedman joined a bell-ringing society known as the Scholars of Cheapside 
and served as the societies treasurer in 1662. In 1664, Stedman went on to 
join another bell-ringing society known then as the ”Society of Colledg(sic) 
Youths,” which has since been renamed the Ancient Society of College Youths 
and is still in existence today ([Wh], p771). Stedman remained with the 
society becoming Steward of College Youths in 1677 and eventually in 1682 
Master of the Society. Stedman’s major contributions to campanology are 
reflected in his efforts on Tintinnlogia and Campanalogia, the first two books 
published on the subject, in 1668 and 1677, respectively ([Wh], p771). 

To leap right into a discussion of Stedman’s work, campanology, or group 
theory even in their most general terms without some cursory definitions 
would be futile. Below is a glossary of a few essential terms: 


e Transposition: a cycle (i,j) of length 2 which interchanges i and j 


e Change: the swapping of one or more disjoint pairs of adjacent bells 
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e Plain change: involves swapping one pair of adjacent bells only 
e Cross change: involves more than one swapping pair of bells 
e Round: A unique ordering of the bells (i.e. (1,2,3,...,7)) 


The following picture was taken from [Wa], page 71.(see Fig 5.4) 


Ring of eight bells in modern iron frame (Warner London) 


Figure 5.4: 


In the beginning, change ringing concerned itself with a single row of 
bells whose order could be denoted by (1, 2,3,...,n). Considering the case 
where n = 6 the concepts of plain and cross changes can be understood more 
clearly. If we use only plain changes we can generate permutations of the 
bells as follows: 


6 
6 
6 


Doh br 
rR Fr ho 
ee WwW Ww 
Wow eR 
DH Ot Ot Ot 


9, 


It should be fairly obvious on inspection that the first plain change swaps 
1 and 2, the second swaps 3 and 4, and the third swaps 5 and 6. Considering 
the same set of six bells acted upon by a cross change, the same result is 
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achieved in one change, as seen below: 
123 4 5 6 
21 4 3 6 5. 
More useful and interesting patterns can be generated by combining plain 
and cross changes. The plain lead on four bells is one of the most simplistic 
patterns and was devised sometime around 1621 by alternating consecutive 


cross and plain changes as seen below: 


er wwe FM Ne 
NNW Bwnw ke pp 
WNRHE DWH BW 
Bee po wmorr wwik 


It is easy to see that the pattern which defines the plain lead on four bells is 
nothing more than a cross change followed by a plain change on the middle 
two bells until we reach the round, which is where we started. Generat- 
ing the plain lead on four bells is analogous algebraically to generating the 
dihedral group on four elements, D4. We begin by representing the cross 
change as a = (1,2)(3,4) which swaps the first two and last two bells and 
representing the plain change as 6 = (2,3) which swaps the middle pair. 
Algebraically, D4 is generated by multiplication. We begin with the first 
element in the group, a. To generate the next element in the group we 
multiply this first element by 6. To generate the third element we sim- 
ply multiply this second term, ab, by a to get aba. Continuing on in this 
manner we multiply alternately by a then 6 to generate the dihedral group 
Dy = {a,ab, aba, (ab)?, a(ab)’, (ab)*, (ab)%a, (ab)*}. Since (ab)* yields the 
round, we say (ab)* = 1 and D, = {1,a,ab, aba, (ab), a(ab)?, (ab)°, (ab)?a}. 
While this simple example illustrates the implicit elements of group theory 
which seem to be at the heart of bell ringing, moving on to a more complex 
example illuminates perhaps more significant implications. 

We turn our attention now to the composition which is commonly referred 
to as Plain Bob Minimus. Plain Bob Minimus begins at the round and ends at 
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the round (1, 2,3, 4) and contains all possible permutations of these four bells. 
Calling the earlier definitions to mind, it should be evident that generating 
the Plain Bon Minimus composition is equivalent algebraically to generating 
the symmetric group on 4 elements, $4, which is shown: 


1234 1342 1423 
21483 3124 4132 
2413 3214 4312 
4231 2341 3421 
4321 2431 3241 
3412 4213 2314 
3142 4123 2134 
1324 143 2 12483 

1234 


We can now analyze this composition as we did Dy. We begin first by let- 
ting a = (1,2)(3,4) and 6 = (2,3) represent possible changes between rows. 
If we look at the first column of the Plain Bob Minimus composition, we see 
that it is nothing more than the dihedral group, D4, which is a subgroup of 
54. To generate the second column of $4 we introduce a c = (3,4) and we 
simplify our notation by letting k = (ab)°ac. Multiplying through we gen- 
erate the second column, {k, ka, kab, kaba, k(ab)?, ka(ab)?, k(ab)?, k(ab)?a}. 
Almost immediately we should realize that this is the left coset kD4. Em- 
ploying ¢ again to obtain the third column yields k?D4, which is the final 
left coset since multiplication by c a third time brings us to rounds. The 
generation of the Plain Bob Minimus shows that 54 can be expressed as the 
disjoint union of cosets of the subgroup D4, that is, also stated, the cosets 
of D4 in Sq partition $,. There is an important generalization of this fact, 
which states: 


Theorem 111 For any group G and any subgroup H, the cosets of H inG 
partition G. 


(See Exercise 5.9.5 above.) 

Now, since we chose a = (1, 2)(3,4), 6 = (2,3), and c = (3,4), where 6 and 
c are obviously by definition 2-cycle or transpositions and a is the product 
of two such 2-cycles or transpositions, we have shown a further result, that 
each element of 54 can be written as a product of 2-cycles. More generally, 
we can state the following theorem: 
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Theorem 112 Let f be a member of S,, i.e., let f be any permutation of 
degree n. Then f can be written as a product of transpositions. 


To sketch a proof of this theorem (following [G]) and hence prove Theorem 
58 as promised, we need only to recall that: Every permutation of 5, can be 
written uniquely (up to order) as a product of disjoint cycles (Theorem 53 
above). 
Note that any cycle can be written as a product of transpositions as 
below: 
(a1, 42, ...,@) = (41, G4) (1, Ap_1).--(@1, a2). 


We see that since any permutation can be written in terms of cycles and 
any cycle can be written as product of transposition, it follows that every 
permutation of $, can be written as a product of transpositions. 

Considering both the plain lead on four bells and the Plain Bob Minimus 
composition, it is obvious that group theory is latent in the study of cam- 
panology. As White concludes in his essay, he is not suggesting “that Fabian 
Stedman was using group theory explicitly, but rather that group theoretical 
ideas were implicit in (Stedman’s) writings and compositions” ({Wh], p778). 
Whether we can consider Stedman the first group theorist, then, is unclear; 
what is clear, however, is that when we hear the bells of Westminster Abbey 
chime, we are hearing not just melody but mathematics. 


Chapter 6 


Graphs and ”God’s algorithm” 


”_..O, cursed spite, 
that ever I was to set it right!” 
Hamlet, Act 1, scene 5 


In this chapter we introduce a graphical interpretation of a permutation 
group, the Cayley graph. This is then interpreted in the special case of a 
group arising from a permutation puzzle. 

To begin, what’s a graph? A graph is a pair of countable sets (V, ), 
where 


e V is a countable set of singleton elements called vertices, 
e F is a subset of unordered pairs {{v1, v2} | v1, v2 € V} called edges. 


A graph is drawn by simply connecting points representing vertices to- 
gether by a line segment if they belong to the same edge. 
A digraph, or directed graph, is a pair of countable sets (V, FE), where 


e V is a countable set of vertices, 
e F is a subset of ordered pairs {(v1, v2) | v1,¥2 € V} called edges. 


A digraph is drawn by simply connecting points representing vertices 
together by an arrow if they belong to the same edge (v1, v2), the araow 
originating at v, and arrowhead pointing to v2. 
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If e = {v,, v2} belongs to EF then we say that e is an “edge from vy, to 
v2” (or from v2 to v,). If v and w are vertices, a path from v to w is a finite 
sequence of edges beginning at v and ending at w: 


€9 = {v, U1}, ey = {1,02}; seg On = {Un w}. 
If there is a path from v to w then we say v is connected to w. We say that a 
graph (V, F) is connected if each pair of vertices is connected. The number 


of edges eminating from a vertex v is called the degree (or ”valence”) of v, 
denoted degree(v). 


Example 113 : /f 
V= {a,b,c}, B= {{4, b}, {a,c}, {b, c}}, 
then we may visualize (V, E) as 
/\ 


/ \ 


a* --- * b 


Each vertex has valence 2. 


Definition 114 : [fv and w are vertices connected to each other in a graph 
(V,E) then we define the distance from v to w, denoted d(v,w), by 


d(v,w) = min #{edges in a path from v to w} 


vywEV connected 


By convention, if v and w are not connected then we set d(v,w) = oo. The 
diameter of a graph is the largest possible distance: 


diam((V, #)) = max d(v, w). 


VWwE 


In the above example, the diameter is 1. 
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6.1 Cayley graphs 
Let G' be a permutation group, 
G=< G15 G2, Gn >< Sx. 


The Cayley graph of G with respect to X = {g1, ga, .-.59n} is the graph (V, F) 
whose vertices V are the elements of G' and whose edges are determined by 
the following condition: if « and y belong to V = G then there is an edge 
from x to y (or from y to x) if and only if y = g; * x or « = g; * y, for some 
@=1,2,...,n. 

The Cayley digraph of G with respect to X = {91, 92, ---, Gn} is the digraph 
(V, E) whose vertices V are the elements of G and whose edges are determined 


by the following condition: if x and y belong to V = G' then there is an edge 
from x to y if and only if y = x * g;, for some 2 = 1, 2,...,n. 


Exercise 6.1.1 Show that the Cayley graph of a permutation group is con- 
nected. 


Lemma 115 Let lg = (V,E) denote the Cayley graph associated to the per- 


mutation group G =< G15 G2. ++ Gn >. Let N= Hn. 9 392,955 ws Ins In S|: 
Then, for allv € V, degree(v) = N. 


proof: Assume not. Then there is a v € V = G with either 

(i) degree(v) < N, or 

(ii) degree(v) > N. 

First, we note that, for each h € {91,97', 92,93 5+59n,9,'}, the set 
{v, h * v} is an edge of Tg. This follows from the definition of the Cayley 
graph. 

If r = degree(v) > N then, by definition of the Cayley graph, there are 
distinct v,...,0, € V with v = h; * v;, for all 1 <2 <r, where the fy,..., A, 
are distinct elements of {91,97 1, 92,9) 5 -:9n:g,'}. This contradicts the 
definition of NV. 

If r = degree(v) < N then, by definition of the Cayley graph, there are 
distinct hj, hj in {91,971 92593 5 -59nsg,'} such that hj *v = h; * v. Since 
G is a group and V = G (as sets), we may cancel the v’s from both sides of 
the equation h; *v = h; * v, contradicting the assumption that h,; is distinct 
from h,. 
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Example 116 : Let 
G =< 81,8) >= $3, 


where 8; = (1 2), and 52 = (2 3). Then the Cayley graph of G with respect 
to X = {81,52} may be visualized as (see Fig. 6.1) 


Cayley graph of 34 


Figure 6.1: 


Example 117 : Let 
G =< 84,59 S= $3, 


where 8; = (1 2), and sy = (2 3). Then the Cayley digraph of G with respect 
to X = {81,52} may be visualized as (see Fig. 6.2) 


Exercise 6.1.2 Construct the Cayley graph of C4, the cyclic group, with 
respect to the generator s = (1,2,3,4). 


Exercise 6.1.3 Construct the Cayley graph of S4, the symmetric group on 
four letters, with respect to the generators s; = (1 2), 52 = (2 3) and 53 = 
(3 4). 


Exercise 6.1.4 Construct the Cayley digraph of 5'3 with respect to the gen- 
erators f = (1,3), r = (1,2,3). (Show, in particular, that f,r do indeed 
generate 53.) 
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Cayley digraph of 33 


with generators $18 


2 


Figure 6.2: 


Example 118 : Let 
G=< R,L,U,D,F,B> < Sra 


be the group of the 3 x 3 Rubtk’s cube. Each position of the cube corresponds 
to an element of the group G (i.e., the move you had to make to get to that 
position). In other words, each position of the cube corresponds to a vertex 
of the Cayley graph. Each vertex of this graph has valence 12 


Exercise 6.1.5 Check this. 


Moreover, a solution of the Rubik’s cube is simply a path in the graph 
from the vertex associated to the present position of the cube to the vertex 
associated to the identity element. The number of moves in the shortest pos- 
sible solution is simply the distance from the vertex associated to the present 
position of the cube to the vertex associated to the identity element. The di- 
ameter of the Cayley graph of G is the number of moves in the best possible 
solution in the worst possible case. 


6.2 God’s algorithm 


Problem: Let G be the group of a permutation puzzle. Find the diameter 
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of the Cayley graph of G. 

This problem is unsolved for must puzzles (including the 3 x 3 Rubik’s 
cube) and appears to be very difficult computationally. The cases where it 
is known include (with no attempt at completeness) the following: 


puzzle diameter 
pyraminx 11 (not including tip moves) 
2 x 2 Rubik’s cube 14 


For the 2 x 2 Rubik’s cube, see [CFS]. 

Problem: Let G' be the group of a permutation puzzle and let v be a 
vertex in the Cayley graph of G. Find an algorithm for determining a path 
from v to the vertex vp associated to the identity having length equal to the 
distance from v to vp. 

This problem is much harder. The algorithm, if it exists, is called God’s algorithm. 
A good reference for recent progress on God’s algorithm for various Rubik’s 


cube-like puzzles may be found on Mark Longridge’s www page [Lo]. 
Exercise 6.2.1 Find the Cayley graph of the “sliced squared” group 
G =< M2, M2,M3 >, 


where Mp is the middle slice move which turns the middle slice parallel to 
the right face clockwise 90 degrees (with respect to the right face). Find the 
diameter of this graph. 


Let T be a graph. A Hamiltonian circuit on T is a sequence of edges 
forming a path in P which passes through each vertex exactly once. (If you 
think of the vertices as cities and the edges as roads then a Hamiltonian 


circuit is a tour visiting each city exactly once.) 

The following unsolved problem was first mentioned in this context (as 
far as I know) by A. Schwenk: 

Problem: Let G be the group of the 3 x 3 Rubik’s cube puzzle. Does the 
Cayley graph of G have a Hamiltonian circuit? In other words, can we (in 
principle) ”visit” each possible position of the Rubik’s cube exactly once, by 
making one move at a time using only the basic generators R,L,U,D,F,B? 

This is a special case of a more general unsolved problem: For an arbitary 
permutation group with more than two elements, it is not known if the Cayley 
graph is Hamiltonian [CG]. 

An example of one where it is known is the following: 
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Example 119 Let G be the group S,, with generators given to be the set of 
all transpositions: 


G=S,, X={(i,j)|1Si<j <n}. 


(There are many more transpositions than necessary to generate S,, since 
the subset of transpositions of the form (i,i+ 1), 1<ai<n-—1, suffice to 
generate S,, [R].) The algorithm of Steinhaus (see §3.3) shows that there is 
a Hamiltonian circuit in the Cayley graph of S, with respect to X. 


The reader interested in more examples is referred to [CG]. 


6.2.1 The Icosian game 


Sir Wiliam Hamilton, an Irish prodigy of the 18th century, may have origi- 
nated the problem of finding Hamiltonian paths (hence the name) by patent- 
ing a game called the Icosian game or the Hamilton game. The idea is to find 
a Hamiltonian path around the vertices of the icosahedron. A picture (from 
the MacTutor History of Mathematics archive [OR]) of the orginal game is: 
(see Fig. 6.3) 


Figure 6.3: 
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6.3. The graph of the 15 puzzle 


This section, which is based on [Mc], discusses the 15 puzzle from the graph- 
theoretical point of view following [W]. 

The 15 puzzle was introduced in §4.1 above. The object of the puzzle 
was to order the pieces from one to fifteen from left to right, top to bottom, 
as shown in the solved position: (see Fig. 6.4) 


Figure 6.4: 


To solve a mixed up puzzle, one would slide the squares around in the 
puzzle. In order to do this you must slide a numbered square into the place 
of the space. We could represent this mathematically by saying that this is 
a transposition of that numbered square and the blank. 

If we label each space in the puzzle in the above Figure, as a vertex, and 
label the vertices numerically, then the resulting graph is represented by (see 
Fig. 6.5) 

We will let 16 denote the blank and call this graph [. The only legal 
moves of the puzzle are transpositions of the 16th vertex and a vertex that is 
adjacent to it. Therefore, any permutation of the vertices produces a labeling 
on I. 
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Figure 6.5: 


6.3.1 General definitions 


Now let T be a simple graph with the vertex set V(I) of cardinality N. 
(In the above example N = 16.) By a labeling we mean the placement of 
the numbers one through N on distinct vertices of , where N denotes the 
blank. In other words, a labeling on [ is a bijective mapping f : V(T) > 
{1,2,...,N}. Two labelings f,g on I are adjacent if and only if g is a result 
of a single transposition on f of vertex N with a vertex adjacent to N on f. 
In other words, f and g are adjacent if they differ by one legal move of the 
puzzle. From [, we make a new graph puz(T) as follows [W]: the vertex set 
V(puz(L)) contains all labelings on I’, and two vertices in puz(I) are joined 
by an edge if the associated labelings are adjacent. 

For example, the labelings below are adjacent(see Fig. 6.6 and Fig. 6.7) 


We can consider a sequence of moves on I’ to be a path p, such that 


p = (Xo, %1,%2,-..,U,) where the xis are vertices of T, and x; and 2;_; are 
adjacent. Such a path p is said to be from 2g (its initial vertex) to x, (its 
terminal vertex). The path p is simple if xo, 1, v2,...,U%, are distinct. If xo = 


x, then (a not necessarily simple path) p is called a closed path based at xo. 
Let xo be a fixed vertex of [. The set of paths based at xo forms a group 
(under composition of paths) called the homotopy group of [ based at xo, 
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Figure 6.6: 


Figure 6.7: 
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denoted F(a). 
Now suppose we paint the blocks of the 14-15 puzzle in a checkerboard 
pattern: (see Fig. 6.8) 


Figure 6.8: 


In this arrangement the blank would start on a white square. If we were 
to move the blank up, then it is now on a black square. That is one transpo- 
sition, therefore the movement is odd. If we then move the blank to the left, 
the blank would be on a white square. This is a total of two transpositions; 
therefore, the movement is even. After three transpositions the blank would 
be on a black square, therefore it would be an odd permutation. Therefore if 
the blank ends on a white square, an even permutation has occurred. If the 
blank ends on a black square, an odd permutation has occurred. 

A legal position of the 15-Puzzle is any sequence of legal transpositions 
starting from the solved position such that the blank ends up in the bottom 
right-hand corner. Each such position corresponds to a permutation of the 
15 numbered vertices and hence to an element of the symmetric group S45. 
The set of all such permutations (arising as a sequence of transpositions) in 
Sis forms a group called the group of the 15-Puzzle. 


Note that the group of the 15-puzzle is isomorphic to the homotopy group 
of the 15 puzzle graph based at the ”blank” vertex. 
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If we assign the number 16 to the blank, then we can see that we can 
arrange the pieces of the puzzle in 16! different ways. However, if we take 
only legal positions of the 15-Puzzle, then we are fixing one of the pieces. 
As a result the number of ways to permute the rest of the pieces, with the 
blank on the white square at the bottom right-hand corner, is at most 15!. 
All such permutations have to be even, by the checkerboard analysis above. 
15!/2 is the number of even permutations of 15 elements (there are an equal 
number of even and odd permutations). From this we see that the 15-puzzle 
has 15!/2 possible legal positions. 


Theorem 120 The positions with the empty space at the bottom right that 
can be reached from the start position of the 15-Puzzle by shifting tiles are in 
a bijective correspondence with the 15!/2 = 1,307, 674, 368,000 even permu- 
tations of the numbers from 1 to 15. 


Remark 9 The 14-15 Puzzle cannot be solved, because it is an odd permu- 
tation. It only has one transposition, 14 interchanged with 15. 


proof: If we label the empty space 16, then every possible position of the 
puzzle may be regarded as an element of the symmetric group Sig and an 
element of puz(I’). There are 16! elements in S16, and 16! vertices of puz(T’). 
With the argument earlier we can show that all legal positions of the puzzle 
are obtained by an even number of transpositions. Therefore, all legal moves 
are even permutations of the puzzle. 

Now we must show that there is a certain 3-cycle in the group of the 
15-Puzzle. For example, if we shift the three pieces surrounding the empty 
space around in a circle following the order of moves south-east-north-west 
then the three-cycle (11, 12, 15) is produced, as indicated in the previous 
Figures combined with (see Fig. 6.9 and Fig 6.10) 

It can be shown that if you fix the 11 and 12 pieces, then any other piece 
can take the place of the 15 by following one of the cycles below:(see Fig. 
6.11 and Fig 6.12) 

By Lemma 160, such 3-cycles generate A,;. This proves the theorem. 

Alternatively, with some work we can show that any number can replace 
the 11 in the three-cycle, and we can show that any other number can replace 
the 12. From this we can conclude that any three-cycle can be formed. Since 
every even permutation is a combination of three-cycles (by Proposition 159), 
every even permutation of the 15-Puzzle can be reached. 


6.3. THE GRAPH OF THE 15 PUZZLE 


Figure 6.10: 
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Figure 6.11: 


Figure 6.12: 
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With this information, we can make a generalization to rectangular puz- 
zles of sizem xn with m>1andn> I: 


Theorem 121 The group of anim x n rectangular puzzle is the alternating 
group Amn—1: 


The proof of this is similar to the proof for the 4 x 4 puzzle, if m > 3 and 
n > 3. (The special cases when 1 < m < 4 or 1 < n < 4 must be treated 
separately). The size of the alternating group is given by (mn — 1)!/2. 


6.4. Remarks on applications, NP-completeness 


Cayley graphs have been used by computer scientists to model interconnec- 
tion networks for parallel processors (see [CFS], [CG] for some references). 

The problem of finding an efficient algorithm for the shortest solution to 
the m x n puzzle is difficult. It amounts to finding the shortest path between 
two points in a graph which is, in general, a difficult problem computationally 
[GJ]. 

There is a class of problems called ” NP-complete” problems. Without 
getting into precise details which would take us too far afield, this is a class 
of problems which are in some sense “equally hard” to solve. If you can find 
a “polynomial time” algorithm to solve one then you can find one to solve 
any other problem in that class as well. For example, [GJ] and [BCG] have 
a list of games and puzzles whose solutions are NP-complete problems. 
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Chapter 7 


Symmetry groups of the 
Platonic solids 


”Plato said God geometrizes continually.” 


Plutarch Convivialium disputationum, liber 8,2 


”We do not listen with the best regard to the verses of a man 
who is only a poet, nor to his problems if he is only an algebraist; 
but if a man is at once acquainted with the geometric foundation 
of things and with their festal splendor, his poetry is exact and 
his arithmetic musical.” 


Ralph Waldo Emerson 
Society and Solitude 
chapter 7, Work and Days 


This chapter requires a little more mathematical sophistication from the 
reader than the earlier chapters. However, the exercises are (I think) choosen 


to be doable. 


7.1 Descriptions 
The ”Platonic solids” are the 5 regular polyhedrons: 
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polyhedron | # faces | # vertices | # edges | group | p,q 
tetrahedron 4 4 6 T 3,3 
hexahedron 6 8 12 O 4,3 
octahedron 8 6 12 O 3,4 
dodecahedron 12 20 30 I 5,3 
icosahedron 20 12 30 I 3,5 
Here: 


p, called the face degree, denotes the number of edges bounding each face, 

q, called the vertex degree, denotes the number of faces meeting each 
vertex. 

A vertex of one of these solids is therefore specified by the q-tuple of 
faces meeting that vertex. We saw several examples of this already when 
we specified notation for the movements of the associated Rubik’s cube-like 
puzzles in chapter 5. 


These solids may be drawn in rectangular coordinates using 


polyhedron coordinates 
tetrahedron (1,1,1), (1,-1,-1), (-1,-1,1), (-1,1,-1) 
hexahedron (1,1,1), (1,1,-1), (-1,1), (-1,1,1), 
(lel-1), Cle), Cle 11), Clyl-1) 
octahedron (1,0,0), (0,0 i (0,1,0), 
(-1,0,0), (0 1,0), ( ’ rh) 
dodecahedron (0,+¢7',+4), ( o',+¢,0), 
( 0,0 ? o”'), ( L, l, 1), 
icosahedron | (1,0,¢), (1,0 o (-1,0,¢), (-1,0,—4), 
(0,¢,1), (0 by 1), (0,—¢,1), (0,-9¢,1), 
(¢,1,0), (¢,- 1 0), (—9,1, ), (—¢,-1,0) 


where @ denotes the golden ratio. 


If P,, P2, P3 are three vertices of an icosahedron which form a triangular 
face then (P,; + P,+ P3)/3 forms a vertex of the dual dodecahedron and every 
vertex of the dual dodecahedron arises in this way. 


The three ”Platonic groups” (the group of symmetries” of these figures) 
will be described below. Their names: 


T = symmetric group of the tetrahedron = tetrahedral group, 


O = symmetric group of the octahedron (or cube) = octahedral group, 


I = symmetric group of the icosahedron (or dodecahedron) = icosahedral group. 
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7.2 Background on symmetries in 3-space 


This subsection presents, with some proofs, background on isometries in 3 
dimensions necessary for understanding the symmetry groups of the Platonic 
solids. 

We fix once and for all the ”right-hand-rule” orientation in 3-space. We 
call a distance-preserving transformation in 3-space which fixes the origin a 
symmetry of 3-space. We say that such a symmetry is orientation preserving 


if it preserves the right-hand rule orientation. 


Example 122 : Lets: R°* — R° denote the function which takes each vector 
v belonging to R° and returns its reflection s(v) about the yz-plane. This is 
not orientation preserving since it reverses the direction of a counterclockwise 
moving circular path in the yz-plane. In terms of rectangular coordinates, 


s(x,y, 2) = (—2,y, 2). 


Let 


R® = {(x,y,z) | 2, y, 2 real numbers} 


denote 3-space. We also write this, when convenient, as column vectors 


xv 
R°={{ y | |a,y,z real} 
Zz 


The distance function on R?® is the function 
db, 0) = (a1 = 2)? + (1 — yo)? + (a — 


where 0, = (21, 91,21), 02 = (#2, Y2, 22). This may be expressed in terms of 
theinner product @)-02 = ¢1%2+y1y2+2122 as d(v1, v2) = WAG — B2) + (8, — vy). 
Conversely, the polarization identity: 


1 
Vp Vy = 5 (IP + %l|? + |[%, — eI I?) 


allows one to recover the value of the inner product from the knowledge of 
the values of the distance function. 
We call a function f : R? — R? an isometry if it satisfies 


d(f (v1), F(e2)) = der, v2) 
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for all vy and vg belonging to R®. 

We want to understand isometries a little better since they will preserve 
distances (and, in particular, preserve the shapes of solids) and therefore 
provide us with the kinds of symmetries of 3-space we want to consider. We 
can construct isometries using certain types of 3 x 3 matrices. (Appendix 1| 
of this chapter gives a little background on matrices. ) 


Lemma 123 If A is a3 x 3 matrix then the function A: R® > R® is an 
isometry if and only if A'* A = Is, where A' denotes the transpose identity 
matrix (obatined by flipping the entries of A about the diagonal. 


Remark 10 /n particular, if A is an isometry then det(A)? = det(A‘) det(A) = 
det(At * A) = det(J3) = 1. 


proof: The distance function is preserved if and only if the dot product 
function is preserved. (This is a consequence of the ” polarization identity” 
- see the appendix.) Let m(v,w) = v- w, where - denotes the vector dot 
product. Since m(Av, Bw) = v- (At * B)w, we have 


m(Av, Awe) = m(v, wv), Vv,w € R° 
if and only if 

B- (Abe BNB=8-w, Vo, w € R® 
if an only if At * A = Is. 


You may have been wondering how one could construct an isometry. This 
lemma gives us lots of examples. 


Example 124 A rotation matrix in 3-dimensions may be written in the form 


ry r9 sin(@) sin(w) 
R(¢,9,) = Po4 2 sin(@) cos(w) 
sin(@) sin(@) —sin(@) cos(¢@) cos(9), 


where 


ry, = cos(@) cos(w) — cos(@) sin(¢) sin(2), 

ry = sin(@) cos(%) + cos(@) cos(¢) sin(%), 
ra = — cos(¢) sin(%) — cos(@) sin(¢) cos(%), 
rq = — sin(@) sin(%) + cos(@) cos(¢) cos(%). 


and where the angles 6,0, are the “Euler angles”. This represents the 
rotation of 3-space obtained by the following sequence of rotations: rotate by 
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angle ~ about the z-axis, rotate by the angle 6 about the x-axis (0 < 8 < aw), 
then rotate by angle @ about the z-axis again. 

Although this is an interesting fact due to its explicitness, we shall not 
use this expression. 


Question: Are there any isometries which do not come from matrices as 
in the above lemma? Yes: any translation gives rise to an isometry. 

Question: Are there any examples of isometries which do not arise from 
a composition of a translation and an orthogonal matrix? No: the following 
theorem classifies all the isometries. 


Theorem 125 A function f : R°? — R® is an isometry firing the origin if 
and only if f is left multiplication by an orthogonal matrix. 


This will not be proven here (see Artin [Ar], chapter 4, section 5, Propo- 
sition 5.16). 

As a consequence of this lemma, we see that if the matrix A gives rise to as 
isometry then det(A) is either equal to 1 or -1 (since det(A)? = det(A'* A) = 
det(I3) = 1). In particular, the determinant of such a matrix is non-zero, so 
the matrix is invertible. 


Lemma 126 The set of all 3x3 matrices A such that the function A: R® > 
R®? is an isometry forms a group under matrix multiplication. 


Exercise 7.2.1 Verify the group axioms needed to prove this lemma. 


Notation: This group will be denoted O3(R) and called the orthogonal group 
of R°. We denote by SO3(R) the following subset 


SO3(IR) = {A € O3(R) | det(A) = 1}. 


which is called the special orthogonal group of R®. 


Lemma 127 SO3(R) is a subgroup of O3(R). 
Exercise 7.2.2 Verify the group axioms for SOs(R). 


It is known that the number of cosets in O3(IR)/S'O3(R) is 2. In fact, it 
is known that 


O3(R) = SO3(R) Us * SO3(R) (disjoint union) (7.1) 


where s is the reflection in the above example (this follows from [Ar], chapter 
4, section 5). 
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Lemma 128 The isometry A in O3(R) is orientation preserving if and only 
if det(A) = 1. 


We will not prove this lemma here. 


7.3 Symmetries of the tetrahedron 


Fix a tetrahedron centered at the origin, with one vertex along the z-axis. 
Each edge has an “opposite” edge on the tetrahedron (which is actually per- 
pendicular to it if you look at it straight on). Each vertex has an ”opposite” 
face. (see Fig. 7.1) 


Figure 7.1: 


There are orientation preserving symmetries (called rotations) of the 
tetrahedron and orientation reversing symmetries of the tetrahedron. The 
orientation preserving symmetries of the tetrahedron will be denoted ST. 
They are obtained as follows: 


e the 4 axes of symmetry through the centers of the faces yield 2 elements 
each (120 degree clockwise rotation when viewed from outside and a 
240 degree rotation), for a total of 8 elements, 
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(This ”tetrahedral symmetry” allows for the mechanical construction 
of the pyraminx.) 


e the 3 pairs of edges (formed by an edge and its opposite) yield one 
element each (a 180 degree rotation), for a total of 3 elements. 


These, plus the identity, give 12 elements in ST. 
Using the coset decomposition (7.1), we have T = STU s* ST (disjoint), 
so |T'| = 24. 


Remark 11 /t turns out that ST is essentially the alternating group Aq of 
even permutations of S4 and T is essentially S, itself. We shall state the 
precise result in the next chapter. 

7.4 Symmetries of the cube 

We fix a cube centered about the origin in 3-space. The set of centers of 


the faces of a cube forms a set of vertices of an octahedron drawn inside the 
cube. This octahedron is called the ”dual” polyhedron. (see Fig. 7.2) 


Figure 7.2: 
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These two polyhedra have the same symmetry group, which we denote 
by O. There are orientation preserving symmetries, or rotations, of the cube 
and orientation reversing symmetries of the cube. The orientation preserving 
symmetries of the cube will be denotes SO. They are obtained as follows: 


e the 3 axes of symmetry through the centers of the faces yield 3 elements 
each (90 degree clockwise rotation when viewed from outside, a 180 
degree rotation, and a 270 degree rotation), for a total of 9 elements, 


(This “hexahedral symmetry” allows for the mechanical construction 


of the Rubik’s cube.) 


e the 4 axes through the opposing vertices yield 2 elements each (all of 
order 3), for a total of 8 elements, 


(This ”tetrahedral symmetry” allows for the construction of the skewb 
[H].) 


e the 6 axes through the opposing mid-edge points yield 1 element each 
(of order 2), for a total of 6 elements. 


These elements, plus the identity, yield 24 elements. 


Lemma 129 There are 24 orientation preserving elements in O, ie., |SO| = 
24. 


The above sketch is one way to see why this is true. Here’s another 

proof: Let V be the set of vertices of the cube. The group SO acts on 
the set V. Fix a v belonging to V and let H = stabso(v). One can check 
that |H| = 3 (since the only symmetry which fixes v is a rotation g about 
the line through v and its opposite vertex. Since g is order 3, H =< g > is 
order 3 as well). We have |V| = 8, so by a lemma in the previous chapter 
on orbits and stabilizers, we have |SO/H| = |V|. By Lagrange’s theorem, 
\SO| = |SO/H||H| = 8-3 = 24. 

Now we know SO, what is O? Note that s, the reflection in the example 
in the previous section, belongs to O. Using the coset decomposition of the 


previous section, we have the coset decomposition 
O=SOUs*SO — (disjoint union). 


We know that |s * SO| = |SO| = 24, so 
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Lemma 130 The order of the octahedral group is |O| = 48. 


Remark 12 /t turns out that SO is essentially the symmetric group S4 and 
O is “isomorphic to the direct product” Sy x Cy. We shall state the precise 
definitions and result in the next chapter. 


7.5 Symmetries of the dodecahedron 


Figure 7.3: 


The set of centers of the faces of a dodecahedron forms a set of vertices 
of an icosahedron drawn inside. This icosahedron is called the ”dual” poly- 
hedron. We fix a dodecahedron in 3-space so that the vertices of the dual 
icosahedron are as listed in section 1 above. (see Fig 7.4) 

Let SJ denote the group of orientation preserving symmetries of the do- 
decahedron. Note SJ is a finite subgroup of SOs(R). Let J denote the 
group of all symmetries of the dodecahedron. Note J is a finite subgroup of 
O3(IR) and that SJ is a subgroup of J. Let F denote the set of faces of the 
dodecahedron, so |F'| = 12. ST acts on F. 


Lemma 131 S/ acts on F transitively. 
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Figure 7.4: 


We won’t prove this. If you look at a dodecahedron it follows ” by inspec- 
tion”. The reason why this is useful is that it tells us that if « is any face 
then any other face can be obtained from x by applying some element of SJ. 
In other words, the orbit of x is all of F: Sl*a= F. 

If x is any face then the only orientation preserving symmetries which 
don’t send x to a different face is a rotation by an integer multiple of 72 
degrees about the line passing through the center of 2 and the center of its 
opposite face. There are, for each face x, exactly 5 distinct rotations of this 
type. Therefore, 

|stabs7(x)| = 5. 


By a lemma in the section on orbits, we have 
ST/stabs;(a) = ST * x, 
so |S1| = |stabs7(x)||S7* «| = 5-12 = 60. 


The elements of SI include: 
e rotation by 2* 2 *k/5, k =0,1,2,3,4, about the line passing through 
the center of a face and its opposite, 


This ” dodecahedral symmetry” allows for the construction of the megam- 
g 
inx.) 


7.6. APPENDIX: SYMMETRIES OF THE ICOSAHEDRON AND Sg 145 


e rotation by 2* a *k/3, k = 0,1,2, about the line passing through a 
vertex and its opposite, 


e rotation by 7 about the center of an edge. 
Subgroups include: 


e stabilizer of a vertex. These are all cyclic of order 3, and they are all 
conjugate. There are 10 distinct such subgroups since a vertex and its 
opposite share the same stabilizer. 


e stabilizer of a face. These are all cyclic of order 5, and they are all 
conjugate. There are 6 distinct such subgroups since a face and its 
opposite share the same stabilizer. 


e stabilizer of an edge. These are all cyclic of order 2, and they are all 
conjugate. There are 15 distinct such subgroups. 


Exercise 7.5.1 Verify all these. 


Remark 13 /¢ turns out that STI is essentially the alternating group As of 
even permutations of Ss and I is “isomorphic to the direct product” As x Ss. 
We shall state the precise result in the next chapter. 


For an excellent discussion of the symmetries of the icosahedron, see [Ba]. 


7.6 Appendix: Symmetries of the icosahe- 
dron and 5¢ 


A duad is a pair of diagonals (a diagonal is a segment from a vertex to its 
antipodal opposite vertex) of the icosahedron. The top of the icosahedron 
has 6 vertices and each diagonal must have exactly one of these 6 vertices as 
an endpoint. There are 12 vertices, hence 6 diagonals, hence 


(s)= 


different duads. Each duad determines a ” golden rectangle” (i.e., a rectangle 


whose ratio length/width is either the golden ratio 6 = (1 + V5)/2 or its 
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icosahedron with duad 


Figure 7.5: 


inverse. We may identify a duad with a pair of distinct integers {(7,7) | 1 < 
i<j <6}, ie., with a 2-cycle in Sg. A duad may be pictured as (see Fig. 
7.5) 

Each element of the rotation group of the icosahedron (i.e., the group 
of orientation-preserving symmetries of the icosahedron) must send a duad 
to a duad. Each duad has 4-fold symmetry, i.e., can be send to itself in 4 
ways. There are 15 duads, so there are 4-15 = 60 ways to send a duad to 
another. This is precisely the number of orientation-preserving symmetries 
of the icosahedron. 

The 18-th century mathematician Sylvester (who, though from Great 
Britain, once taught in Maryland) called a partitioning 


X= X,U...UX, — (disjoint), 


of a set X a syntheme if each of the sets X;,1 <7 <n, has the same number 
of elements. If we take 


X = {set of diagonals of the icosahedron} 


then a syntheme is a set of three duads, no two having a diagonal in common. 


There are 
6 4 
_ 
(3)()a 15 
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different synthemes (the 3! since there are 3! ways to permute the duads 
amongst themselves). A syntheme may be represented by a coloring of the 
vertices on the top of the icosahedron using three colors, each for exactly two 
vertices. We may identify a syntheme with a product of 3 distinct 2-cycles 
in Sg. A syntheme may be pictured as (see Fig. 7.6) 


icosahedron with true cross syntheme 


Figure 7.6: 


We partition the set of 15 duads into 5 groups of 3 as follows. (Recall 
each syntheme is a triple of duads.) First, pick a syntheme, A;. Pick another 
syntheme A», so that A,,A2 have no duad in common. Continue on in this 
way until you pick five synthemes Aj,..,As5, no two of which have a duad 
in common. Such a choice of 5 synthemes is called a pentad. There a 6 
pentads, which we label P,,..., Pg in any way you like. (A list of the six 
pentads is given in [R], chapter 7.) Any permutation of the 6 diagonals of 
the icosahedron gives rise to a permutation of the set of 6 pentads. Hence any 
permutation of the 6 diagonals of the icosahedron, which may be regarded as 
an element of Sg, gives rise to a permutation of the set of 6 pentads, which 
may also be regarded as an element of Sg. This gives a map 


fi Se — Se. 


This example will be discussed further in the next chapter. See also [R] and 
[Bal]. 
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Chapter 8 


Groups, II 


” The art of doing mathematics consists in finding that special 
case which contains all the germs of generality.” 


David Hilbert 


Groups are analogous to molecules. We (mathematicians) want to know 
what they “look like”, we want to know how to describe them, how to com- 
pare them, how to “make” more of them, if they fall into ”families” with 
similar properties... 

Given two groups G';, Go, a natural question is to ask how ”similar” are 
they? (Exactly what is meant by ”similar” will be explained later.) We shall, 
in this chapter, introduce notions and techniques useful for comparing two 
groups. In a later chapter, we will focus on the 3 x 3 Rubik’s cube group by 
comparing it to “better understood” groups. 


8.1 Homomorphisms 


A homomorphism between two groups is, roughly speaking, a function be- 
tween them which preserves the (respective) group operations. 


Definition 132 Let G,,G» be groups, with *, denoting the group operation 
for Gy and *2 the group operation for G2. A function f : Gy — G2 ts a 
homomorphism if and only if, for all a,b © Gy, we have 


F(a #1 b) = f(a) *2 f (6). 
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Exercise 8.1.1 Prove the following: If f :G, — Gz is a homomorphism of 
groups then 


f(Gi) = {9g € Go | g = f(x), for some x € G,} 


is a subgroup of Go. 


The subgroup f(G1) < G» is called the image of f and is sometimes 
denoted im(f). 


Example 133 Let G be a group and h a fixed element of G. Define f :G— 
G by 


f(gh=hhegeh, g eG. 
Then the following simple trick 


flab) =hol*(axb)eha=h | *xaxheh «beh = f(a) * f(b) 


shows that f is a homomorphism. In this case, im(f) = G, i.e., f is surjec- 
tive. 


Exercise 8.1.2 Let 
0 1 0 1 0 0 
A= 100 4], B=[{0 01 
001 0 1 0 


Now, let G=< A,B > denote the group of all matrices which can be written 
as any arbitrary product of these two matrices (in any order and with as 
many terms as you want). We have 


no ao) 
oO 
seo oLYfnme, 


i 
0 
0 
i 
0 
0 


or @& 
CoroFell™ 
oCorere 


0 
0 
1 
0 
, 0 
}(: 
(You may want to try to check this as an exercise by regarding each such 
matrix as a permutation matrix.) Define f :G— S3 by 


ee" OH OO 
eo) 
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g L(g) 
Ts 1 
A 81 
B 89 
AxB 81 * 82 
BxeA 82 * 84 
Ax Bx A | 81 * 82 * 81 


Show that this is a homomorphism. 
Example 134 The function 
sgn: S, 3 {+1}, 


which assigns to each permutation its sign, is a homomorphism. One reason 
why this is true is because the sign of a permutation g is the determinant of the 
associated permutation matrix P(g). Since the determinant of the product is 
the product of the determinants (this is a basic result in linear algebra [JN]), 
we have sgn(gh) = det P(gh) = det P(g) det P(h) = sgn(g)sgn(h), for all 
g,h € S,. From this it follows that sgn is a homomorphism. 
Lemma 135 If f : Gy — G2 is a homomorphism then 

(a) f(e1) = €2, where e, denotes the identity element of Gy and e2 denotes 
the identity element of Go, 

(b) f(x") = f(x)", for all x belonging to Gi, 

(c) fly" #1 2%, y) = fly) 2 f(a) *2 fly), for all x,y belonging to Gi, 
(*, denoting the group operation for G, and *, the group operation for Gy). 

proof: (a) We have f(x) = f(a *, e1) = f(x) *2 f(er), for any « € Gi. 
Multiply both sides of this equation on the left by f(a)7t. 

(b) We have, by part (a), eg = f(e1) = f(x #1 27') = f(x) 2 f(x"). 
Multiply both sides of this equation on the left by f(a)7t. 


Exercise 8.1.3 Prove part (c). 


Definition 136 Let G,,G_ be finite groups. We say that G, embeds (or 
injects) into Gy if there exists an injective homomorphism f : Gy > Go. 
A homomorphism f : Gy, — G2 is a isomorphism if it is a bijection (as 
a function between sets). In this case, we call Gy and G2 isomorphic and 


write Gy = Gg. An isomorphism from a group G to itself is called an 
automorphism. 
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The notion of an isomorphism is the notion we will use when we want 
to same two groups are “essentially the same group”, i.e., one is basically a 
carbon copy of the other with the elements relabeled. 


8.2 Homomorphisms arising from group ac- 
tions 


Lemma 137 Let G be a group and X a finite set. If G acts on X (on 
the left, resp. on the right) then there is a homomorphism G — Sx given by 
gt— @,. Conversely, if ¢:G— Sy is a homomorphism then o(g): X + X 
defines a (left, resp. right) action of G on X. 


Exercise 8.2.1 Prove this. 


Example 138 Let G be the Rubik’s cube group generated by the basic moves 
R,L,U,D,F,B. For each move g € G, let p(g) be the corresponding permu- 
tation of the set of vertices V of the cube and let o(g) be the corresponding 
permutation of the set of edges E of the cube. Let S,, denote the symmetric 
group on n letters and identify Sy with Ss, Sg with Sy. Then 

(a) p: G— Ss is a homomorphism, 

(b) 0: G— Sq is a homomorphism. 


Exercise 8.2.2 Prove this. 


Definition 139 Let G act on a set X. We call the action k-tuply transitive 
if for each pair of ordered k-tuples (a1, %2,..,©x), (Y1,Y2,+ YK) of elements 
belonging to X there is ag €G such that y; = ¢,(2;) for each 1 <i<k. 


Exercise 8.2.3 Is the Rubik’s cube group 2-transitive on the set of edge 
facets? 


The following result is one illustration of how unique the symmetric group 
and alternating group are. (Recall that the alternating group A, was defined 
in example 82 above.) 


Theorem 140 /fk > 5 and G is a group acting k-transitively on a finite set 
X then G is isomorphic to S,, or to A,, for somem>k or somen > k+2. 

Conversely, S, acts n-transitively on {1,2,...,n} and A, acts (n — 2)- 
transitively on {1,2,...,n}. 
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This is proven in [R]. 


Remark 14 We shall discuss an example of this in 8813.6, 13.7. We shall 
also, in conjunction with our group-theoretical determination of the Rubik’s 
cube group proven later, be able to deduce from Theorem 140 the following 


Corollary 141 (a) The Rubik’s cube group G acts 6-transitively on the cor- 
ners, leaving the edges alone. It acts 8-transitively on the corners but may 
permute two edges. 

(b) The Rubik’s cube group G acts 10-transitively on the edges, leaving 
the corners alone. It acts 12-transitively on the edges but may permute two 
corners. 


8.3 Examples of isomorphisms 


Example 142 Let G be the group in Exercise 8.1.2 and f : G — $3 the 
homomorphism. This is in fact an isomorphism. 


Example 143 Let H be the subgroup of the Rubik’s cube group generated by 
the basic move R: H =< R>. Then H = C4 (where C4 denotes the cyclic 
group of order 4). 


Example 144 Recall that to cach permutation g of the set {1,2,...,n} we 
can associate an Xn permutation matrix P(g) in such a way that 


al Vg(1) 

v9 x (2) 
P(g)| 2 |=] % 

qn Lg(n) 


Here the image of t under the permutation g is denoted g(t), though in fact 
one plugs i into g from the left.) We let P, denote the set of alln xn 
permutation matrices. This is a group under matrix multiplication. The 
function 
P:S, 3 P,, 
g > P(g) 


is an isomorphism. The proof that this is a bijection and a homomorphism 
was given earlier in the chapter on permutations. 
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Example 145 From [NST], we have the following table of isomorphisms: 


name notation isomorphic to 
symmetry group of tetrahedron T S4 
rotation group of tetrahedron ST A4 
symmetry group of octahedron O S4 xX Co 
rotation group of octahedron SO S4 
symmetry group of icosahedron I As X C2 
rotation group of icosahedron ST As 
symmetry group of regular n-gon Do, n odd 
D, X Co, n even 
rotation group of regular n-gon D,, 


Example 146 This example may be found in [B]. 
Let Q denote the quaternion group: 


Q= (1, 1,2 2,3 jek, k}, 


where 7? = j? =k? = —-1,147 = k,gk = 2, ki = j, and in general, xy = —yx 


for x,y belonging to 1,j,k. Then Q is isomorphic to the group 


Qe =< a,b>< G, 
where 
a= F? * Mp*U-'* My! *U7! * Max U * My! *U * F?, 
b= BF «U2 «FN eU te bt B12 * Be UL, 


via the map f : Q* > Q defined by ¢(a) =2, o(b) =k. 
The proof of this claim will be formulated in a later chapter as an exercise. 
(The easiest way to prove this uses ideas we haven't yet introduced. ) 


Exercise 8.3.1 As above, let G be a group and h a fixed element of G. 
Define c,:G—>G by 


en(g) =heg*h", Gg EG. 
Show that this is an automorphism. 


Definition 147 An automorphism as in the above exercise is called inner. 
An automorphism of G which is not of this form, for some h € G, is called 
outer. 
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(solution to Exercise 8.3.1: To verify this, we must show that f = c;, is an 
injective and surjective homomorphism (we drop the subscript for simplicity 
of notation). 

First, we show that f is injective. Suppose f(gi) = f(g2). Then f(g * 
g,') =1, so that h*g, * gy! *h-+=1. Multiply both sides of this equation 
on the left by A and on the right by h~!. We obtain g;*g;' = 1. This implies 
91 = J, so f is injective. 

Now we show f is surjective. Let g be an arbitrary but fixed element of 


G. Let y= h-t*a*h. Then 
fy) =f(ht*xa*xh)=hehlt*xrxh*eh =z. 


Therefore, f is surjective. 

We verified previously that f is a homomorphism.) 

Notation: The set of all automorphisms of a group G' is denoted Aut(G). 
The subset of inner automorphisms is denoted 


Inn(G) := {f € Aut(G) | f =cn, some h € G}, 
in the notation of the above 8.3.1. 


Exercise 8.3.2 (a) Show Aut(G) is a group with composition as the group 
operation. 

(b) Show that Inn(G) is a subgroup of Aut(G). 

(c) Show that the function 6: G — Inn(G) defined by o(h) = ch is a 


homomorphism. 


8.3.1 Conjugation in S,, 


The following result will be of importance to us in a later chapter: 


Lemma 148 Suppose f : S, — S, is an inner automorphism. If g € S,, is 
a disjoint product of cycles of length ky,...,k, then f(g) is a disjoint product 
of cycles of length ky,..., ky. 


In other words, an inner automorphism (i.e., conjugation) must ” preserve 
the cycle structure”. Lemma 50 , whose proof was promised earlier, follows 
from this. 

proof: Since f is inner, let it be conjugation by h € S, say, so f(g) = 
h-'gh, for all g € S,. Let (2)g € {1,...,n} denote the image of 2 € {1,...,n} 
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under g € S,,. The lemma is a consequence of the following simple calcula- 
tion: if (¢)g = 97 then, for all 1 <7 <n, we have 


((i)h)(h-'gh) = (j)h. (8.1) 


In other words, if g sends 7 +> j then h~'gh sends (2)h +> (j)h. It follows 
that g and h~'gh have the cycle structure. 


Theorem 149 Two elements g,g' € S, are conjugate if and only if they 
have the same cycle structure. 


proof: The Lemma proves the ”only if” direction of this equivalence. 
Suppose that g,g’ € S, have the same cycle structure. Write their dis- 
joint cycle decompositions using the lexicographical ordering imposed on the 
lengths of the cycles occurring in the decomposition: say 


= (ty eee dng ony 41 eee Ung )eee(bng $1 eee dn)s 


Me Ua Varga ce ng dong pn oe Uads 


g 
g =(t.. 


where 1 < ny <... <n, <n. Pick an h € S, such that h si; -=> a, for all 
1<j<n. Then g’ = ho'gh, by (8.1). 


8.3.2 Aside: Automorphisms of S,, 


Though we shall not need it here, the following fact is interesting since it 
illustrates what a unique role the symmetric group 5S¢ plays in the family of 
all symmetric groups. 


Theorem 150 Jf n 4 2,6 then the homomorphism @: S, — Aut(S,) (in 
part (c) of the exercises above) is an isomorphism: 


S, = Aut(S,). 


Ifn =6 then |Aut(S¢)| = 2 - |S]. 


Continuing our example from the appendix to the above chapter on the 
Platonic solids: 
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Example 151 Any permutation of the 6 diagonals of the icosahedron, which 
may be regarded as an element of Sg, gives rise to a permutation of the set 
of 6 pentads, which may also be regarded as an element of Sg. This gives a 
map 


f 256 Se, 


which is in fact a homomorphism. This homomorphism is injective so it is 
actually an automorphism. 

However, a 2-cycle on the set of 6 diagonals (i.e., swapping exactly 2 di- 
agonals) does not induce a 2-cycle on the set of these 6 pentads. In fact, a 
2-cycle on the set of diagonals gives rise to a product of three disjoint 2-cycles 
on the set of these 6 pentads. Therefore, by the above theorem (which says 
that an inner automorphism must preserve the cycle structure) this automor- 
phism f cannot be an inner automorphism. 


8.4 Kernels and normal subgroups 


Let f : Gy — G2 be a homomorphism between two groups. Let 


ker(f) ={9 © G1 | f(g) = eg}, 


where €2 is the identity element of G2. This set is called the kernel of f. 
Lemma 152 ker(f) is a subgroup of Gy. 
Exercise 8.4.1 Prove this. 


Example 153 Let 
sgn: S, > {+1} 


denote the homomorphism which associates to a permutation either 1, if it 
is even, or -1, if it is odd. Then A, = ker(sgn) C Sy. 


The following properties of the kernel are useful: 


Lemma 154 : Let f : Gy > G2 be a homomorphism between two groups. 
(a) f is injective if and only if ker(f) = {er}. 
(b) if g belongs to the kernel and x is any element of G then x~!*g *x 
must also belong to the kernel. 
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proof: (a) f is injective if and only if f(gi1) = f(g2) implies g, = ge 
(91,92 € Gy). Note f(g1) = f(g2) is true if and only if f(g, * gy!) = eo. If 
ker(f) = {e,} then f(g: * gy!) = e2 implies gy * gy’ = e2, which implies 
91 = g2, which implies f is injective. 

Therefore, if ker(f) = {e2} then f is injective. Conversely, if f is injective 
then f(x) = f(e1)(= e2) implies x = e,(a € G). This implies ker(f) = {e1}. 

(b) Multiply both sides of eg = f(g) on the left by f(x)~' and on the 
right by f(a). We get 


e2 = f(a)! ¥en * f(a) = f(a") * f(g) * fe) = f(a *g * x), 


as desired. 


Definition 155 Let H be a subgroup of G. We say that H is a normal 
subgroup if, for eachg € G, g-'* H*xg =H (i.e., for each g € G and each 
he H,g'*xhx*q belongs to H). 


Notation: Sometimes we denote ”H is a normal subgroup of G” by 
H«aG 
Example 156 A,,<S,, and |A,| = $|S,|- 
We have already shown the following 


Lemma 157 If f : Gy — G2 is a homomorphism between two groups then 
ker(f) is a normal subgroup of G. 


Remark 15 The following remarkable result about the alternating group will 
not be needed in this course, except as an example of a group with no normal 
subgroups provided for the readers’ cultural benefit. It will not be proven here. 
(For a proof, see for ecample [R].) It is - believe it or not - connected with the 
fact that you cannot solve the general polynomial of degree 5 or higher using 
radicals, t.e., that there is no analog of the quadratic formula for polynomials 
of degree 5 or higher. 


Theorem 158 If X has 5 elements or greater then Ax has no non-trivial 
proper normal subgroups. In other words, if H < Ax is a normal subgroup 


then either H = {1} or H= Ax. 
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The next fact about the alternating group will be needed later in our 
determination of the structure of the Rubik’s cube group. This fact also 
arose in connection with our previous discussion of the ”legal positions” of 
the 15 puzzle. 


Proposition 159 Let H be the subgroup of S,, generated by all the 3-cycles 
in S,, then H = Ay. 


proof: Since sgn : S, — {+1} isa homomorphism, and since any 3-cycle 
is even, any product of 3-cycles must also be even. Therefore, H C A,. If 
g € A, then g must swap an even number of the inequalities 1 << 2 <.4.< 
n—1 <n, by Definition 37. Therefore, (since any permuation may be written 
as a product of 2-cycles, Theorem 58) g must be composed of permutations 
of the form (¢ j)(k 2) or (@@ 7) &). But @ g)\(k 1) = (ij ky & I) and 
(i j)(9 k) = (09 k&). Therefore, g € H. This implies A, C H,so A, = H. 


The following more precise result is very useful for the purposes of the 


analsis of permutation puzzles. 


Lemma 160 (/W/) Let X be a finite set, |X| > 3 and fix u,v as elements 
in X. Then the 3-cycle (u,v, x), « an element of X — {u,v}, generates Ax. 
This lemma proves an even stronger statement than the previous claim. Now, 
instead of only one element being fixed, there are two elements fixed and the 
group is still generated. 


8.5 Quotient subgroups 


One of the most useful facts about normal subgroups is the following: 


Lemma 161 /f H is a normal subgroup of G then G/H is a group with the 
following operation: 


aH *bH =(ab)H, (aH) '=a"'H, 


for all a,b belonging to G. The identity element of this group is the trivial 
coset H, 


Exercise 8.5.1 Prove this. 
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This group G/H is called the quotient group of G' by H and is sometimes 
pronounced ”G mod H”. 


Example 162 /f f :G, — G2 is a homomorphism between two groups then 
Gi/ker(f) is a group. 


The” basic building blocks” of the collection of finite groups are the groups 
which have no non-trivial quotient groups. Intuitively, this is because a non- 
trivial quotient group is closely related to the original group but smaller in 
size (and hence perhaps subject to analysis by an inductive argument of some 
type). These ”basic” groups are called ” simple”: 


Definition 163 A simple group is a group with no proper normal subgroups 
other than the trivial subgroup {1}. 


Example 164 [fp is a prime then C, (the cyclic group having p elements) 
is simple. In fact, if G is any group which is both abelian and simple then 
there is a prime p such that G = C,. Ifn > 4 then A, is simple (as was 
stated above in Theorem 158). These facts are proven in [R]. 

Simple groups are not very abundant. In fact, the first non-abelian simple 
group is of order 60 (it’s As). 


The following basic result describes the quotient group Gy/ker(f). 


Theorem 165 ("first isomorphism theorem”) If f : Gy 3 G2 is a homo- 
morphism between two groups then G,/ker(f) is isomorphic to f(G,). 


proof: ker(f) is a normal subgroup of G,, so G,/ker(f) is a group. 
We must show that this group is isomorphic to the group f(G1). Define 
&: Gi/ker(f) > f(G2) by o(g- ker(f)) = f(g), for g € G1. We must show 

(a) @ is well-defined, 

(b) ¢ is a homomorphism, 

(c) disa bijection. 

If g- ker(f) = g'- ker(f) then g7'g’ € ker(f), since ker(f) is a group. 
This implies f(g7'g’) € f(ker(f)) = {1}, so f(g) = f(g’). This implies ¢ is 
well-defined. 

Since ker(f) is normal, (g-ker(f))(g-ker(f)) = ga'(g" ker(f)g')ker(f) = 
gg!-ker(f). Therefore ((g-ker(f)\(g"-ker(i))) = olag!-ker(A)) = F(ga!) = 
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(DAG) = o(g- ker(f))o(g'- ker(f)), for all g,g' € G. This implies ¢ is a 
homomorphism. 

It is clear that @ is surjective. To show that @ is a bijection, it suffices 
to prove ¢ is an injection. Suppose that o(g- ker(f)) = o(g'- ker(f)), for 
some g,g’ € G. Then f(g) = f(g’), so f(g7'g’) = 1. By definition of the 
kernel, this implies g~'g’ € ker(f), so g- ker(f) = g'- ker(f). This implies 
@ is injective. 


The other isomomorphism theorems will not be needed but will be stated 
to help to illustrate the usefulness of the notion of normality: 


Theorem 166 (second isomorphism theorem”) If H, N are subgroups of a 
group G and if N is normal then 

(a) HAN is normal in H, 

(b) there is an isomorphism 


H/(HON)& NHN. 


Theorem 167 (third isomorphism theorem”) If Ny, No are subgroups of a 
group G, if Ny C No, and if Ny and No are both normal then 
(a) No/N, is normal in G/N,, 


(b) there is an isomorphism between 
(G/N1)/(N2/M) = G/N2. 
We shall not prove these results here - see [G] or [R]. 


8.6 Direct products 


Definition 168 Let H,, H, be two subgroups. We say that a group G is the 
direct product of H, with Hz, written 


G= HM, x Ho, 
if 
(a) G= Hy x Hy (Cartesian product, as sets), 
(b) the group operation on G is given ”coordinate-wise” (still denoted ”*” 
for simplicity): 
(x1, y1) * (2, Y2) = (@1 * 22,1 * Ya), 


for %1,%2 © Ai, y1,y2 € He (where * denotes multiplication in Hy, Hz, and 
G). 
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Example 169 Let G be (as a set) the Cartesian product G = C3, where 
X? = X x X and where Cy denotes the cyclic group of order 2 (with addition 
mod 2 as the operation). Define addition on G coordinate-wise: 


(my, 1) + (mg, nr2) = (My + Mg, r1 + N29), 


where0 <m;, <1,0<n; <2, for? =1,2, 7 = 1,2. 


Example 170 The symmetry group O of the octahedron is isomorphic to 
S4x Co. The symmetry group I of the icosahedron is isomorphic to As x Cy. 
(This is not isomorphic to Ss, despite the fact that they both have the same 
number of elements and they both contain As as a normal subgroup.) 


8.7 Examples 


Example 171 Consider the subgroup H of the Rubik’s cube group generated 
by the “square slice moves”, 


H =< Mp, Mp, ME > .- 
Then H =< MR > x < MA> x < MZ > Cy x Cy x Ch = CF. 


8.7.1 The twists and flips of the Rubik’s cube 

We recall some notation: 
e X is the set of 48 facets of the Rubik’s cube which are not center facets, 
e V denotes the subset of facets which belong to some corner subcube, 
e F is the subset of facets which belong to some edge subcube. 
e Let G denote the Rubik’s cube group. 


e Let F be the group generated by all the moves of the Rubik’s cube 
group which do not permute any corner or edge subcubes but may 
twist or flip them. 


e Let Sx,Sy, Sz, denote the symmetric group on X,V, £, respectively. 
We may regard FG, as subgroups of Sx. We may also regard Sy, Sz 
as subgroups of Sy (for example, Sy is the subgroup of Sx which leaves 
all the elements of FE fixed). 
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e Let 


Gy =SvNG, Gp=SeNG, W=SyNF, Pp=SpNF. 


Note that the action of G on X induces an equivalence relation as follows: 
we say that a facet f, is equivalent to a facet f, if there is a move of the 
Rubik’s cube which sends one facet to the other. There are exactly two 
equivalence classes, or orbits, of G in X: namely, V and EF. In particular, 
the action of G' on V is transitive and the action of G on F is transitive. 
On the other hand, F' leaves each vertex (resp., edge) fixed, though it may 
permute the corner facets (resp., edge facets) associated to a vertex (resp., 


edge). 


Exercise 8.7.1 Show that: 

(a) The set V of equivalence classes of F acting on V is in bijective 
correspondence with the set of all vertices of the cube. 

(b) The set E of equivalence classes of F acting on E is in bijective 
correspondence with the set of all edges of the cube. 


The interesting thing is that we have 
F=f, x Fp, (direct product). 


Exercise 8.7.2 Notation as above. 
(a) Show F = Fy x Fr (direct product). 
(b) Is Sy = Sy x Sg (direct product)? 


A harder question, which we will answer later in the negative: Is G = 
Gy x Gg (direct product)? 


8.7.2 The slice group of the Rubik’s cube 


The material below can also be found in [BH]. Some ideas are also discussed 
in [Si]. 

Let H be the group < Mpr,Mpr, My > generated by the middle slice 
moves. This group is called the ”slice group”. Let FE be the set of edges of 
the cube (which we identify with the set of edge subcubes), let C’ be the set 
of center facets of the cube, and let X = FE UC. AH acts on X. Note that H 
does not affect the corners (i.e, the vertices of the cube). 


164 CHAPTER 8 GROUPS, II 


Questions: (a) Is the action of H on X transitive? 

(b) Is the action of H on C transitive? 

(c) Is the action of H on E transitive? 

The answer to (a) is no - an edge subcube cannot be sent to a center 
facet, for example, so there is an element of X which cannot be sent to any 
other element of X by an element of H. The answer to (b) is yes - any center 
facet can be sent to any other center facet by an element of H. The answer 
to (c) is no - for example, the uf edge subcube cannot be sent to the ur edge 
subcube by a slice move, so there is an element of F which cannot be sent 
to any other element of F by an element of H. 

The answer of ”no” to (c) brings about the following 

Question: What are the orbits of H on E? 

The answer may be phrased in various ways, but let us look at it in the 
following way: suppose we call two edge subcubes equivalent if one can be 
sent to the other by a slice move (i.e., an element of H). There are 3 disjoint 
equivalence classes: all the subcubes in the middle RL-slice are equivalent, 
all the subcubes in the middle FB-slice are equivalent, and all the subcubes 
in the middle UD-slice are equivalent. The distinct orbits of H acting on E 
are the following: 


e the middle RL-slice, denoted by Fry, 
e the middle FB-slice, denoted by Erg, 
e the middle UD-slice, denoted by Eup. 


Note that 
EL = Ep, U Erp U Eup, 


is a partitioning of F into the distinct equivalence classes defined by the 
action of H on EF. 
Each element of H determines an element in Sy. We have a homomor- 
phism 
f:H—Sx 


This is another way of saying that H acts on the set X, which we already 
know. Note that each basic slice move M (so M is either Mp, Mr, or My) 
is, as an element of Sx, of the following form: 


M = (4—cycle in Sg)(4 — cycle in Sc). 
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Conversely, does an element of Sy uniquely determine an element of H? In 
other words, is f injective (i.e., an embedding)? 

To answer this, fix an h € H and think about what f(h) tells us: f(h) 
tells us which each subcube moves to which other subcube but it doesn’t tell 
us, for example, how a subcube is flipped or rotated. 

The fundamental theorem 60 inspires the following question: 

Question: Can an element of H flip, but not permute, an edge subcube 
(and possibly permuting or flipping other subcubes of the cube)? 

The answer is no. The reason why is that the slice moves can only rotate 
a given edge subcube within the slice it belongs to. 

It follows, therefore, that the permutations of the edge subcube and cen- 
ters determine a unique element of the slice group. In other words, we have 
proven the following 


Proposition 172 The homomorphism 
f:H— Sx 
is an embedding. 
Remark 16 The analog of this for the Rubik’s cube group is false! 
HT acts on the set Epp, so we have a homomorphism 
rrp: H > Se, 


and similarly, ryp : H — Se,,, rrp: H - Sepp. 
HT acts on each of the sets EF and C’, so we have homomorphisms 


r=rrtXrup X trp: > Spy, X Seyp X Sepp C SE; s:H So, 
which we can put together to obtain an injective homomorphism 
rxs:H—> Sp, X Sepp X Skyy X Sc 


To determine H, we determine the image of H in Sz, X Sap, X Sepp X Sc- 
To do this, we first look at the image of H in each of Sz,,, Szp,, and 
SEyp- This is easy enough: 


e the image of H in Sp,, is < Mp >= C4, 
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e the image of H in Sp,, is << Mp >= C4, 
e the image of H in Sz, is < My >= C4. 


Later, we shall want to think of Cy as {0,1,2,3}, with addition mod 4, and 
the image of an element A € H under one of the homomorphisms above, 
rap: H — Sz, say, as an integer 0 < rpap(h) <3. 

Next, we must determine the image of H in Sc. This is easy if it’s looked 
at in the right way. As far as the movements of the center facets is concerned, 
the slice moves may be replaced by their corresponding rotations of the entire 
cube about an axis of symmetry. In this case, we see that the image of H in 
Sc is the same as the image of the orientation-preserving symmetry group 
of the cube! This we know, by the discussion in chapter 7 and Example 145 
above, is isomorphic to $4. 

Putting all this together, we see that the image of H in Sz,, X Szpy X 
Sup X Sc is isomorphic to a subgroup of 


C? x S4. 


We may represent the elements of H , therefore, as 4-tuples (hy, ha, hs, ha), 
with hy, ho,h3 € Cy and hy € Sy. Since each of the generating moves of H 
(namely, Mr, Mu, and Mr) satisfies 


sgn(r(h)) = sgn(s(h)), 
for all h € H, the image of H cannot be all of C}? x S4. 


Proposition 173 The image of H in C}? x S4. is isomorphic to the kernel 
of the map 


t: CF x Sy {+1} 
(hi, ha, hg, ha) > sgn(hi) - sgn(ha)- sgn(h3) + sgn(ha), 


where each sgn is the sign of the permutation, regarded as an element of Sx. 
Exercise 8.7.3 Show that |ker(t)| = (4° -4!)/2 = 768. 


proof: We have shown that H is isomorphic to a subgroup of C} x S4. 
In fact, we know that the basic slice moves Mp, Mu, Mr (which generate H) 
all belong to the kernel of ¢, so H is isomorphic to a subgroup of ker(t) < 
Ce Xx S4. 
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It remains to show that every element in ker(t) belongs to H. To do this, 
we consider the projection homomorphism 


p: HS 


obtained by composing the homomorphism r x s: H — C} x S4 constructed 
above with the projection homomorphism C} x S4 — S4. We have shown 
that p is surjective. Our next objective is to compute the kernel of p and use 
the first isomorphism theorem to determine H. 

Claim: The kernel of p is 


ker(p) = {h € H | s(h) =1, rrac(h) +rup(h) + rrp(h) = 0 (mod 2)}. 


Note that ker(p) is a subgroup of A so the sign of the permutation s(h) 
is equal to the sign of the permutation r(h): 


sgn(s(h)) = sgn(r(h)) = sgn(rrac(h))- sgn(rup(h))- sgn(rrp(h)). 
This implies that 
ker(p) C {h € A | s(h) = 1, rec(h) +rup(h) + rrp(h) = 0 (mod 2)}. 


Conversely, pick an h € H such that s(h) = 1 and rpar(h)trup(h)t+rre(h) = 
0 (mod 2). We may represent this element h as a 4-tuple (m1, n2, nz, 8), with 
0 <ny,ng,n3 <3 ands=1€ Sq. 

For example, 


e the element M, = Mp * Mj' * Mp * Mr is represented by the 4-tuple 
(1,1,0,1), 


e the element My = Mp *« Mp * Mp * M;' is represented by the 4-tuple 
(0,1,1,1), 


e the element M3 = Mp * Mp * Mp! * M;' * Mr M5 ' * Mp Mp is 
represented by the 4-tuple (0,0, 2, 1). 


These elements generate all the elements of the group 
{(a,b,¢e,1) | a,b,c € Cy, a+ b6+¢=0 (mod 2)}. 


Note that the group {(a, b,c) | a,b,c € Cy, a+b+c¢ = 0 (mod 2)} is, in turn, 
the kernel of the map Ci} — C2 given by (a,b,c) —>a+6+c=0 (mod 2). 
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Exercise 8.7.4 Show that |{(a,6,c) | a,b,c € Cy, a+6+c¢ = 0 (mod 2)}| = 
43/2 = 32. 


Therefore, the element h choosen above must be expressible as a ” word” 
in these three elements M,, Mz, M3. This shows that 


{h € H | s(h) =1, rrec(h) +rup(h) +rre(h) = 0 (mod 2)} C ker(p), 


which implies the claim. 

To summarize what we have so far: we have a surjective homomorphism 
p: H — Sq with kernel {h € H | s(h) = 1, rez(h) +rup(h) +rrp(h) = 
0 (mod 2)}. The kernel has |ker(p)| = 32 elements and the image has 
l¢rm(p)| = |S4] = 4! = 24 elements. Since, by the first isomomorphism 
theorem, 


H/ker(p) = Sq, 


we have |H| = 32-24 = 768. But the kernel of the homomorphism ¢ : 
C} x S4 — {+1}, which we know contains H as a subgroup, also has 768 
elements. This forces h = ker(t). 


The slice group of the megaminx 


The subgroup S' of the megaminx group generated by all elements of the form 
| where x,y correspond to faces which have no intesection is called the 
slice group of the megaminx. 

Problem (Longridge): Determine S. 


LKY 


8.8 Semi-direct products 


If a group G' contains two subgroups H, and H2, with H; dG normal, such 
that each element of G can be written uniquely as a product Ayh2, with 
hy € Hy and hz € Hy then we say that G is the semi-direct product of 
Hy, and H». In this situation, H2 is called a complement of H,. In this 
section, we shall define two more ways of seeing how a semi-direct product 
can be expressed. Later, we shall see that the Rubik’s cube group is an easily 
described subgroup of a certain semi-direct product. 


Definition 174 Now suppose that H,, Hz are both subgroups of a group G. 
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We say that G is the semi-direct product of H, by Ho, written 


GH, >< He 


if 
(a) G = Hy, * Aa, 
(b) Hy and Hz only have 1, the identity of G, in common, 
(c) Hy is normal in G. 


This is the “internal version” of the semi-direct product. 


Of course, if we define anything using two apparently different definitions, 
we'd better be sure that they are equivalent! The fact that they are is a 
theorem (Theorem 7.23 in [R]) which we won’t prove here. 

Note that the multiplication rule in G doesn’t have to be mentioned since 
we are assuming here that G is given. 

The ”external version” is defined by a construction as follows: 


Definition 175 Assume we have a homomorphism 
@: Hy > Aut(M,). 
Define multiplication on the set Hy, x Hy by 


(21,41) * (Xo, y2) = (24 * o(y1)(%2), yi * yo). 


This defines a group operation. This group, denoted H, ><gHo, is the (ex- 
ternal) semi-direct product. 


This definition will be used with the example of H; = C7” and Hy = S, 
in the next chapter. 

These last two definitions are equivalent by Theorems 7.22-7.23 in [R]. 

As a set, H, >< Hy, is simply the Cartesian product H, x Ab. 


Example 176 Let R? denote the direct product of the additive group of real 
numbers with itself: 


R? = {(2,y) | ey real}, 


with the group operation being addition performed componentwise. Let C2 
denote the multiplicative cyclic group with 2 elements, whose elements we 
write (somwhat abstractly) as Cy = {1,5}. (We may think of s as being equal 
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to -1 but there is a reason for this notation which shall be made clear soon.) 
Define an action of Cy on R? by 


I(vy)=(a,y), sla y)=(yr), (ty) ER’. 


Let G be the set 
G = R? x Co. 


Define the binary operation *:GxG—=G by 


(91, 21) * (92,22) = (91 + 21(92), 21 * 22), 


for all g,, 92 € G and all z,, 22 € Cy. This is a group - the semi-direct product 
of R? with Co. 

To see this, we must answer some questions: 

(a) closed under the operation? Yes 

(b) existence if identity? Yes, e = ((0,0), 1) 

(c) existence of inverse? Yes, ((x,y),1)7' = ((—2, —-y), 1), and ((2z,y),s)7* 
(<1, —0).5). 


(d) associative? This is the hard one: 


((g1, 21) * (Ya, 22) * (Ys, 23) = (9 + 21(92), 21 * 22) * (9s, 23) 
= (91 + ~1(g2) + (41 * 22) (gs), (41 * 22) * 23) 
(91, 21) * (92, 22) * (9s, %3)) = (Gi, 21) * (92 + 22(9s), 22 * 23) 
= (91 + 21(92) + 21 (22(93)), 21 * (42 * 23) 
This implies associativity. 


Exercise 8.8.1 Let G be the semi-direct product constructed in the above ex- 
ample. Show that R? (which we identify with the subgroup {((x,y), 1) | (v7, y) € 
R*} of G) is a normal subgroup. 


Example 177 Let 
53 = {1, $1, $2, $1*82, $251, $1 *52*5S1 }, A, = {1, $2, $1*52*51 }, Aa = {l, si}. 


Let 
&: Hy > Aut( Ay) 
be defined by 
(1) =1 (the identity automorphism) 
6(s1)(h) = sj) *h*s, = 5, *h* 5 


(since sy' = 51), hE Ay. 
Define the (external) semi-direct product of Hy, Hy by G= Hy, >< Hyg. 
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Exercise 8.8.2 Let G be the semi-direct product constructed in the above 
example. Show that H, (which we identify with the subgroup {(h,1) | h € Hy} 


of G) is a normal subgroup. 


There is of course a close relationship between internally defined semi- 
direct products and externally defined ones. The following lemma, which is 
proven in [R], explains this connection: 


Lemma 178 If G is the (internal) semi-direct product of Hy by Hz (so Hy 
is a normal subgroup of G) then there is a homomorphism 


o: Hy > Aut( Ay) 
such that G = Hy ><g(A2). 


Example 179 Let Cy be the cyclic group of order d, which we may regard as 
a setasC, = {0,1,...,d}, with addition performed mod d. Let N = C%, which 
we regard as the group of “n-vectors” with “coefficients” in Cy. Let H = S, 
be the symmetric group on n letters, i.e., the group of all permutations 


p:{1,2,...,n} 2 {1,2,...,n}. 
The group H acts on N by permuting the indices, t.e., the coordinates of the 
vectors. For p € S,,, define p* : Ch + CT by 
P"(V) = (p-A(ays 5 UpA(ny)s Vv = (01, ...,Un) © Ch. 
Now, for p,q € 5, and v,w € C7, define 
(p,v) *(q,w) = (pq,w + q(v)). 


This defines a semi-direct product C? >< S,. 

(It is known that C} ><S,, is isomorphic to the group of all ”C,,-valued 
nxn monomial matrices”, [R], Exercise 7.33. A monomial matrix is a 
matrix which contains exactly one non-zero entry for each row and column.) 


8.9 Wreath products 


Let G, be a group, let Gy be a group acting on a finite set X2. Fix some 
labeling of Xz as say Xp = {hl,h2,...,hm}, where m = |X| and let Gy? 
denote the direct product of G, with itself m times, with the coordinates 
labeled by the elements of X9. 
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Definition 180 The wreath product of G1,G, is the group 


Gi wrG, = GY? >dG, 
where the action of Gz on GY is via its action on Xo. 


In particular, to each t € G wr Gy there is a gg € Gy. We denote this 
projection by gz = pr(t). Define the base of the wreath product by 


B={t€GiwrG, | pr(t) = 1}, 
soB=G?. 
Example 181 Let R" denote the direct product of the additive group of reals 
with itself n times. The group operation on R" is componentwise addition. 


Let S, denote the symmetric group. This acts on R” by permuting coordt- 
nates: ifr ES, is a permutation then define 


(X15 -5 Un) = (Lr(1)s 5 Lr(n))s 


for (a1...,U,) € R". This action respects the addition operation: 


ray + Yt, ey Un t Yn) = (81, +05 rn) Hr (Y1, 5 Yn) 


(@1...,%n), (Y1--, Yn) € R". (Incidently, it also preserves scalar multiplication: 
r(a* (@1,..,2%n)) = a* (21,252), 


for (a1...,%,) € R",a ER, so r defines an invertible linear transformation on 
R"; in fact, there is a homomorphism S,, + Aut(R"), where Aut(R") denotes 
the group of all invertible linear transformations on R"; do you recognize this 
homomorphism? It has occurred previously in the notes...) 
Let G' be the set 
G=R"x S, 


and define a binary operation *: Gx G—G by 
(91, Pi) * (G2, P2) = (g1 + Pi(G2), Pr * P2), 


for all g1,g2 € G and all py, pp € S,. This is a group. 
(a) closed under the operation? 


(b) existence if identity? e = ((0,0),1) 
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(c) existence of inverse? ((x,y),1)>' = ((-2,-y),1), (ay),p) = 
(—p7'(2,y),p7'). 
(d) associative? This is the hard one: 


(91, Pi) * (92, P2)) * (93,3) = (gi + pilge), Pr * P2) * (93, P3) 
= (91 + pi(ge) + (pr * p2)(g3), (Pr * p2) * ps) 
(91, P1) * (92, Pa) * (93, P3)) = (91, Pr) * (G2 + P2(gs), P2 * Ps) 
= (1 + pi(g2) + pi(p2(9s)), Pr * (P2 * Ps)) 


This implies associativity. 
This group ts the wreath product of R with S,: 


G=Rwr5S,, 


where R is the base. 


Lemma 182 (a) The base B, which is isomorphic to the direct product Geel 
is a normal subgroup of G, wr Go, 


(b) (Gi wr G)/B is isomorphic to Go. 
For this, we refer to chapter 8 of [NST] or to [R]. 


Example 183 Let H be the enlarged Rubik’s cube group of all legal and il- 
legal moves of the 3 x 3 Rubik’s cube. In other words, in addition to the 
usual basic moves (namely, R,L,U,D,F,B), we allow you to take apart the 
cube and reassemble the corner and edge subcubes (but we do not allow you to 
remove stickers from the facets). Let C3 denote the group of all rotations of a 
particular corner subcube by a 120 degree angle (actually, this group depends 


on the corner being rotated but since these groups are all isomorphic we drop 
the dependence from the notation). Let Cy denote the group of all flips of a 
particular edge subcube (rotations by a 180 degree angle). (Again, this group 
depends on the edge being flipped but since these groups are all isomorophic 
we drop the dependence from the notation). Then we shall prove later that 


H = (C3 wr Sy) x (Co wr Sz), 


where V is the set of corner subcubes and E is the set of edge subcubes. 
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8.9.1 Application to order of elements in C,, wr S,, 


In some cases, wreath products turn out to be relatively concrete and familiar 
groups. Let S(n,m) denote the group of all n x n monomial matrices with 
entries in C,,. 

We begin with the following result: 


Theorem 184 There is an isomorphism between C,, wr 5S, and the group 
S(n,m) which sends an element (0, f) € Cy wr S,, 0 = (041, V2, ..50n) € Cr 
and f € S,, to the matrix P(f)v. 


This follows from a result (see Theorem 197) which will be proven in the 
next chapter. It is also a special case of an exercise in [R]. In any case, it is 
clear from this that an element (v, f) € C,, wrS,, 0 = (01, V2, ...,Un) € CC 
and f € S,,, is order d only if the permutation matrix P(f) is order d. Indeed, 


We conclude with the following classification of the elements of order d 
in the wreath product. 


Lemma 185 An element (v,f) € CyrwrS,, 8 = (v1, 02,...,0n) € C™ and 
f ES), is order d if and only if f6=1 and&+...+ fo 4(8) =0. 


This result can, in principle, be used in conjunction with with the explicit 
determination of the Rubik’s cube group given later to determine all the 
elements of a given order. As an application, the elements of order 2 of the 
Rubik’s cube group will be given later. 


Chapter 9 


The Rubik’s cube and the word 
problem 


Further details in the following background material may be found in [R], 
[MKS]. 


Definition 186 Given a list L of questions, a decision algorithm for L is a 
uniform set of unambiguous instructions which, when applied to any question 


in L gives the correct answer “yes” or “no” after a finite number of steps. 


9.1 Background on free groups 


Let X = {2x1,...,t,} denote a set and X7! a set disjoint from X whose 


elements we denote by {zj',...,c7'}. Assume that the map x +— «7! defines 


a bijection X — X7~!. It will be convenient to let 2! = 2, x2 = 1, where 1 is 
an element not belonging to X UX7! which we will call the identity element. 


A word on_X is a sequence 
w = (a1, d2,...,4N), 
where N > 0 is some integer and each ai belongs to 
XUX TU Ci}. 


The sequence of all 1’s is called the empty word. The inverse of the word w 
is the word 


wo! = (ay',.-, 47). 
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If a; = y5", where e; is in {0,1,—1} and y; € X, then we shall write the word 
was w= yyl..yy. 


Example 187 Let X = {R,L,U,D,F,B}. The set of words on X are in 
a bijective correspondence with the set of sequences of basic moves you can 
make on the Rubik’s cube. 


We call a word w = yj'...yy% on X reduced if either w is empty or if the 
exponents e; are non-zero and if there are no x € X with x,2~' adjacent in 
w. 


Definition 188 The free group F,, = Fy on the generators 21,...,%7 is the 
group of all reduced words on X. 


The proof that F,, is a group is not entirely easy (verifying the associa- 
tivity property is perhaps the hardest part), see Theorem 11.1 in [R]. 


9.1.1 Length 


If, in the notation above, w = y{1...y\v is a reduced word (so e; € {0,1,—1}) 
then we call N the length (or reduced length, to be more precise) of w. 

If G =< q1,...,9, >C S, 1s a finite permutation group generated by 
permutations 91, ..., g, then we may still define the notion of length: 


Definition 189 Suppose g € G is not the identity, where G is a permutation 
group as above. Then g may be written 


g = YUN 


where each y; © {g1,..-,94} and where e; € {0,1,-1}. The number N and the 
sets {y1,...,yn}, {e1,--,en} may not be unique for a given g but among all 
such possibilities there is at least one such that the value of N is minimum. 
We call this the length of g, denoted (gq). 


Let 
Pe({t)= So, 


geG 


This is called the Poincaré polynomial of G. 
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The length of g is the distance in the Cayley graph between the vertex 
g and the vertex 1. As was mentioned in the chapter on graph theory, the 
problem of determining the largest possible distance in the Cayley graph 
of the Rubik’s cube group is known as ”God’s algorithm” and is currently 
unsolved. 


Example 190 Let G = S,, with generators g; = (t,1+ 1),2 = 1,..,n —1. 
The Poincaré polynomial is known: 
ttt — | 


Il}, ———— 
k=1 t—1 ; 


by [Hum], §3.16. 


In case n = 2, this is 
P4+2P4+2t+1=(t4+1)(?4+t41) 


Problem What is the longest possible length of an element of the Rubik’s 
cube group (with respect to the generators R,L,U,D,F,B)? 


Problem What is the Poincare polyomial of the Rubik’s cube group (with 
respect to R,L,U,D,F,B)? 


9.1.2 ‘Trees 


We may represent the free group graphically as follows. We define the Cayley 
graph of F,, inductively: 


e draw a vertex for each element of X U X7~! (these are the vertices, Vi 
say, for the words of length 1), 


e Suppose we are given that you have already drawn all the vertices for 
the words of length k — 1, V,_, let’s call them. For each x € X UX7! 
and each v € V,_,, draw a vertex for each word of length & obtained 
by multiplying v by z on the right, v * x, and connect v and v * x by 
an edge. 

There are infinitely many vertices, each of which has degree |X U X7']. 
Moreover, this graph has no circuits or loops (i.e., no path of edges crosses 
back over onto itself). Such a graph is called a tree. 
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Example 191 Let X = {R,L,U,D,F,B}. The elements of the free group 
Fy correspond to the mechanically different sequences of basic moves you can 
make on the Rubik’s cube. Of course, different sequences of moves may yield 
the same position of the Rubik’s cube (e.g., R' and 1 are the same position 
but sequence of moves used to attain them are distinct). 

There are infinitely many vertices of the Cayley graph of Fy, each of 
which corresponds to a mechanically distinct move of the Rubik’s cube. 


9.2 The word problem 


There is a way to list all the elements of F;,, called the lexicographic ordering. 
We shall describe a way to list all the words in F;, as though they were in 
a dictionary. We shall give an algorithm for determining if a word w € F,, 
occurs before a word w’, in which case we write w < w’. In the dictionary 
below, we shall, for example, distingish between the the identity 1 and the 
” non-reduced” word x, * xj". 

The first element in this lexicographically ordered list is the word 1, the 
next 2n words are the words 


ay <aypi <u. <a, <a7l. 


In general, we define y..yyr < 21...2n if either (a) M < N or (b) M = N 
and y, < 2 or yy = 4% and yy < zy or yy = 2 and yy = Zz and y3 < z3 Or... . 
List all the elements of F;, as 


F, = {wy, We, wey 


so Wy = 1,wo = %,.... Let G be a subgroup of Ff, or a permutation group 
G =< 91,-.5,9n >. If Gis a permutation group then we regard a word wy, as 
an element of G by substituting g; for each x;,1 <i <n. 


Definition 192 Fiz ag €G. We say that G has a solvable word problem if 
there is a decision algorithm for the list L of questions of the form: Is wp = g 
in G? 


Claim: F;, has a solvable word problem. 
proof The following decision algorithm yields a solvable word problem. 
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1. If w is a word not equal to 1 then underline the first occurance, if any, 
of the expression x; * 2; '. If no such expression occurs in w then go to 
step 3, otherwise go to step 2. 


2. delete the expression x; * 27! from w and go to step 1. 


3. If w = 1 then stop and return ”yes”, otherwise, stop and return ”no”. 


Claim: Each finite permutation group has a solvable word problem. 


Exercise 9.2.1 Why? 


Theorem 193 A decision algorithm for the word problem for the Rubik’s 
cube group with generators R,L,U,D,F,B is the same as an algorithm for 
solving the Rubik’s cube. 


9.3. Generators, relations, and Plutonian robots 


Here’s a hypothetical situation: You and a friend each have a robot on the 
planet Pluto with a scrambled Rubik’s cube. You and your friend also have 
duplicate cubes, scrambled the same way as your robots. (We will call the 
robots R? D? and R? B? if you don’t mind!) These robots have manual dexter- 
ity but no preprogramming on how to solve the cube. Furthermore, assume it 
is very expensive to program different moves, so you want to teach the robot 
the smallest number of separate moves that you can. On the other hand, the 
moves need not be basic moves (U, R, ..) since it we will assume it costs 
roughly the same to teach the robot the move R as the move R*« U? *« R71, 
for example. Your solution will be a ”word” in these taught moves. Again, 
to minimize the cost of transmission, you want the ”word” to be absolutely 
as short as possible. A prize of 1 million dollars has been set up to the first 
of you who can get their robot to solve its cube. 

In other words, we want to solve the word problem for the cube and we 
want to do it as efficiently as possible. Suppose we know we need n generators 
and we know that this is the smallest number. How do we make a ” word” as 
short as possible? To make a word in these generators as small as possible, 
we must know all the “relations” between these generators so we can, if 
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necessary, substitute them into the word and perform some cancelation. This 
is what this section is about. 

Let X be a finite set, say n = |X|. Let Y be a set of reduced words on 
X. Let R be the smallest normal subgroup of F;,, containing Y. Since R is 
normal, the quotient F;,/R is a group. 


Definition 194 Let G be a group. We say that G has generators X and 
relations Y if G is isomorphic to F,,/R. A collection of generators and rela- 
tions defining a group is called a presentation of the group. 


As a matter of notation, an element r € # is written as an equation r = 1 


in G. 


Remark 17 For those with a background in topology: Serre [Ser], 83.3, gives 
a topological interpretation of R as “the fundamental group of the Cayley 
graph of G with respect to X”. 


Example 195 The cyclic group of order 3, C3, has one generator x and one 
relation x° = 1, so 


X={e}, R={(x®)|kEZ} CR =f{e* |k eZ. 


Here the cosets of F/R are R,xR,x?R. The set of these three cosets is closed 
under multiplication. For example, («R)(a?R) = x(Ra?)R = xa?RR = 
z°R = R, so the inverse of the element xR is x7 R. 


More generally, C, has presentation 
C, ={xu | av" = 1}. 


Exercise 9.3.1 By constructing moves of the Rubik’s cube of order 2, 3, 4, 
show that the Rubik’s cube “contains” the subgroups Co, C3, C4. 


Lemma 196 The group C, x C, is presented by 
Cn X Cr, = {a,b | a” = 1,6" = 1,ab = ba}. 


Remark 18 In general, ifG is generated by gy, ..., Gm with relations Ri(g1,---) Gm) = 


1 and if H is generated by hi, ..., hn with relations S;(h1,...,hn) = 1 then 
G x H is the group generated by the g;, hj, with relations R;(g1,...,gm) = 1, 
Si(hi,..,hn) = 1, and g:hj = hjg;. We shall not prove this but refer to 
[MKS], Exercise 18, §4.1. 
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Exercise 9.3.2 By constructing moves of the Rubik’s cube of order 2, 3 
which commute show that the Rubik’s cube “contains” the subgroups Cy x 


Cs = Ce. 


We conclude this section with a table of all the finite groups of order less 
than or equal to 25 and their generators and relations. 


9.4 Generators, relations for groups of order 
< 26 


The following table was obtained from the tables in [TW]. 


Notation: 


C’, = cyclic group of order n, 
D,, = dihedral group of order 2n 
= symmetry group of the regular n-gon, 
Q = quaternion group = {—1,1,—2,2,—j,7, —k, k}, 

S;, = symmetric group of permutations of {1,2,...,n}, 
A,, = alternating group of even permutations of {1,2,...,n}, 
', = finite field with g elements (g=power of a prime), 

Z/nZ = integers modulo n. 
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Order Group G generators relations notes 
2 Cy a a? =] 
3 Cs a ae=1 G = As 
4 C4 a at*=1] 
4 Cy x Co a,b a=1,0?=1, Klein 4-group 
ab = ba Aut(G) = GL(2,F2) 
5 Cs a a= 
6 Cg = Co x Cz a a®=1 
6 S'3 a,b a=1,0=1, Aut(G) =G 
aba = G = GL(2,F2) 
7 Cz a’ =] 
8 Ce a ae = 
8 Cy x C4 a,b a’ =1,b4'=1, 
ab = ba 
8 Cy x Cy x C2 a, b,¢ a=1,b?=1, | Aut(G) = GL(3,F2) 
c? = 1,ab = ba, 
be = cb,ac = ca 
8 D, a,b at’=1,b? =1, 
aba = b 
8 Q a,b a*=1,8? =a’, 
aba = b 
9 Co a a®=1 
9 C3 x C3 a,b a=1,b?=1, | Aut(G) = GL(2,Fs) 
ab = ba 
10 Cho = Co X C5 a al? = | 
10 Ds a,b a=1,b?=1, 
aba = b 
11 Cy at =] 
12 Chg = C3 x C4 a al? =] 
12 Cy x Ce a,b a? =1,b% =1, 
= C, xX Cy x C3 ab = ba 
12 De a,b a® = 1,6? =1, 
aba = b 
12 Ag a,b a=1,0=1, Aut(G) =G 
(ba)? =1 
12 Q¢ a,b a®° = 1,8? =a’, ” dicyclic” 
aba = b 
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13 Cis a a’e=] 
14} Cyg=Cox Cy a ait = | 
14 Dy a,b a’ =1,0=1, Aut(G) =G 
aba = b 
15 | Cys = C3 x Cs a al’ = | 
16 C6 a a6 = | 
16 Cy x Cg a,b a?=1,8=1, 
ab = ba 
16 C4 x C4 a,b at=1,64=1, Aut(G) = GL(2,Z/4Z) 
ab = ba 
16 C3 x Cy a, 6, ¢ a=1,P=1,7=1, 
ab = ba,ac = ca, bc = cb 
16 Cy a,b,c, d a=1,h=1, Aut(G) = GL(4, Fe) 
e=1,d=1, 
ab = ba,ac = ca,ad = da, 
be = cb, bd = db, cd = de 
16 D4 x Co a, 6, ¢ at*=1,P=1,7 =1, 
aba = b,ac = ca,bc = cb 
16 Q x Cy a, 6, ¢ Exercise 
16 a, 6, ¢ a=1,P=1,7=1, 
abc = bca = cab 
16 a,b a?=1,b?=1, 
(ab)? =1,(a-'bP? =1 
16 a,b at=1,64=1, a semidirect product 
aba = b of C4 with C4 
16 a,b a=1,0=1, a semidirect product 
ab = ba? of C’g with Cy 
(C’, normal) 
16 a,b a=1,0=1, a semidirect product 
ab = ba? of C’g with Cy 
(C’, normal) 
16 Ds a,b a=1,0=1, a semidirect product 
aba = b of Cg with Cy 
(Cz normal) 
16 Qs a,b a =1,b% =a", 
aba = b 
17 Cir at’ =] 
18 | Cig = Co x Cy a’® =] 
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18 C3 x Ce a,b ae=1,b°=1, 
= C3 X C3 X Cy ab = ba 
18 53 X C3 a,b,c} a@=1,0P =1,°0=1 
aba = b,ac = ca, bc = cb 
18 Dog a,b a=1,0=1, a semidirect product 
aba = b of Cy with Cy 
(C2 normal), 
Aut(G) =G 
18 a,b,c| @ =1,b =1,7?=1 
ab = ba,aca = c,bcb = 
19 C19 a a’=] 
20 | Co = Cy x Cs a a’? = | 
20 Cy x Cho a,b a? =1,b° = 1, 
ab = ba 
20 Dyo a,b a=], =1, 
aba = b 
20 Qio a, b ae=)|,h =a’, 
aba = b 
20 a,b a=1,64=1, a semidirect product 
ab = ba? of C5 with C4 
(C4 normal), 
Aut(G) =G 
21) Cy =C3x Cr a at = 
21 a, b a’ =1,h=1, a semidirect product 
ab = bat of C7 with C3 
(C’3 normal), 
Aut(G) =G 
22 | Cog = Co X Cry a a’? = | 
22 Diy a,b at=1,0=1, Aut(G) =G 
aba = b 
23 C'3 a=] 
24 | Coq = C3 x Ce a a’* = | 
24 Cy xX Ch a,b a=1,6"=1, 
= Cy x C3 x C4 ab = ba 
24 C3 x C6 a,b,c} a®=1,8 =1,° =1, 


ab = ba,ac = ca, bc = cb 
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24.| Dex Cy | abc} a& =1,0?=1,c =1, 
aba = b,ac = ca, bc = cb 
24] Ayx Co | a,b,c Exercise 
24] Q6 x Co | a,b,c Exercise 
24 | Ds x C3 | a,b,e Exercise 
24| QxCs3 | a,be Exercise 
24] S3xCq | a,b,c Exercise 
24 Dy a, b a?=1,=1, 
aba = b 
24 Qi a, b a? =1,8 =a’°, 
aba = b 
24 Sy a,b at’=1,b0? =1, Aut(G) =G 
(ab)? =1 
24 | SL(2,Fs) | a,b, at=1,8 =a’, Aut(G') = Aut(Q) 
@ = l,aba = 4, = $4, 
ac = cb, bc = cab a semidirect product 
of Q with Cs 
(C's normal) 
24 a, b ae = 1,8 =1, a semidirect product 
aba = 6 of C3 with Cs 
(C's normal) 
24 a,b, ae =1,b4*=1, a semidirect product 
2 =1,beb = c, aba = b, of C3 with D4 
ac = ca (D4 normal) 
25 C5 a a> = 1 
25 C2 a,b Exercise 


Problem Which of these is (isomorphic to) a subgroup of the Rubik’s cube 


group? 


Problem Of those which are subgroups, find moves associated to the gen- 


9.5 The presentation problem 


The following problem is unsolved: 


erators given which satisfy the relations given. (see Fig 9.1) 
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Number of groups of order < 51 


50 
40 
30 
20 
10 

ewSAIWAI YW WA 
a 10 20 30 40 50 


Figure 9.1: 


Problem (Singmaster [Si]): Let G be the Rubik’s cube group. Find a set 
of generators and relations for G of minimal cardinality (i.e., |X|+|Y| 
is of minimal cardinality). 


Problem : Find 
(a) a set of generators for G' of minimal cardinality, 
(b) a set of relations for G of minimal cardinality, 
(c) an expression for each such generator as a word in the basic moves 


R,L,U,D,F,B. 


The part (a) is known: there are 2 elements which generate G [Si]. Part (b) 
is not known (though Dan Hoey’s post of Dec 17, 1995 to the cube-lover’s 
list may describe the best known results [CL]; he suggests that G has a set 
X of 5 generators and a set Y of 44 relations such that the total length of 
all the reduced words in Y is 605). 


9.5.1 A presentation for C? >dS)41 


We begin an assault on the problem of D. Singmaster mentioned above. This 
section was written with Dennis Spellman. 
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We can identify the C®? >< S,4, with the group of (n + 1) x (n + 1) 
invertible monomial matrices g with coefficients in C,,, having the following 
condition on the determinant: if we write g = p-d, where p is a permuta- 
tion matrix and d is a diagonal matrix then det(d) = 1 (this determinant | 
condition is a condition corresponding to the conservation of twists” for the 
moves of the Rubik’s cube). 

We may identify C7 with the subgroup 


{(@1, 0-5 ngs) | U1 UQ.. Uns = 1, rzE Cn} 


and C >< S,41 as a subgroup of the wreath product S,41 wr Cm. 
Consider G = C't! >< S,4,. The group $,41 has presentation 


Sra =< G15 +065 An | (a;a;)"™ = 1, Vi< ,7 <n>, 


where 
3, jg=ttl, 
mi = 4 2, fe—gl >, 
1, i= j. 


The following diagram may help to visualize the exponents m;; in the 
case n = 4: (see Fig. 9.2) 

As a group of (n + 1) x (n + 1) monomial matrices, we identify a; with 
the permutation matrix, 


1 O ve 0 


eo) 
or 


0 .. 0 1 


If I is the (n + 1) x (n + 1) identity matrix and if E;; denotes the matrix 
which is 0 in every entry except the 77 entry, which is 1, then 


8 =1-— by — Bisigg + Bigs + Bins 
The group C,,, has presentation 
Cm =<h|hA™=1>. 
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a 
3 3 
1 2 1 
‘ae + ae) 
_ 2 “ 
3 ra 
a 
{ }1 
Figure 9.2: 


The group C® has presentation 
GC) =a higeady| be Sly ele = one VI 2g Sel SS 
We identify C” with the Cartesian product 
{(hi(a1), h(t), +05 Mn(n)) | ti € Cm}, 
where h;(t) is the diagonal matrix 


At) = 1 — Ey -— Boag tthe tt Bea. 


There are the following identities between the s; and the h,(t): 


This in mind motivates the formulation of the following statement (which 
we shall prove in the next section): 
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Theorem 197 


CP >I Sn H< Ay, 00, ny hy, Rn | (a;a;)™ = 1, 
Vi <ijg <n, 
hin = 1, hih; = hjhi, V1 < 67 < n 
ajhja;' = hj, \i-j| > 1, 
a:hyay' = hz, 


dr hjayy, = hihiar > 


Remark 19 The above result was proven before it was noticed that essen- 
tially the same presentation may be found in the paper [DM] by Davies and 
Morris (where the group C™ >< S141 ts called a generalized symmetric group). 


9.5.2 Proof 


Let P denote the group presented in the above theorem. The claim is, of 
course, that C7 ><d5,4, = P. There is a surjective homomorphism f : 
P—C" ><1S,41 given by sending the generators to the generators. The 
problem is to which that this is injective. Let K = ker(f), so 


|P| = |P/K||K| > |P/K| = |Ch >dShyi] = m"(n + Ih 


Note H =< hy,...,hy | A” = 1, hih; = hjh; > < P is a normal subgroup of 
P since each a; sends a generator of H to a product of them or their inverses. 
Also, note H = Cy. 

We claim that P/H = S,,4,. From this it will follow that |P| = m"(n+1)!, 
proving that |A’| = 1, as desired. 

To establish P/H ~ S41, we show that the presentation on P/H one 
gets from Theorem 2.1 in [MKS] is the same as that of S,41. This is actually 
easy to see: If W(a1,...,@n, hi, ..., hn) = 1 is a relation in the presentation, we 
must determine the word W(ay,...,dn,1,...,h,)H in P/H. Note that every 
relation ”collapses” and becomes trivial except for the relations (a;a;)"" = 1. 


These relations define the presentation for 5,41, as desired. 
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Chapter 10 


The 3 x 3 Rubik’s cube group 


In this chapter, we describe mathematically the moves of the 3 x 3 Rubik’s 
cube. 


10.1 Mathematical description of the 3 x 3 
cube moves 


In this section, we describe mathematically the moves of the 3 x 3 Rubik’s 
cube. As we will see, this will lead eventually to the description of the 
Rubik’s cube group as a subgroup of index 12 of a direct product of two 
wreath products. 


10.1.1 Notation 


First, orient all the corners and edges as in theorem 60. These are depicted 
as follows, except that we have replaced the ”+” in theorem 60 with a white 
square: (see Fig 10.1, Fig. 10.2 and Fig 10.3, ) 

and (see Fig 10.4 and Fig 10.5) 

Let G =< R,L,U,D,F,B > be the group of the 3x3 Rubik’s cube and let 
H be the ’enlarged” group generated by R,L,U,D, F, B and all the illegal” 
moves (where one is allowed to disassemble and reassemble the cube but not 
remove any facets). Let V denote the set of vertices of the cube (which we 
identity with the set of corner subcubes of the Rubik’s cube) and let 


p:H— Sy 
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Figure 10.1: 


Figure 10.2: 
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Figure 10.3: 


Figure 10.4: 
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Figure 10.5: 


denote the homomorphism which associates to each move of the Rubik’s cube 
the cooresponding permutation of the vertices. Let F’ denote the set of edges 
of the cube (which we identity with the set of edge subcubes of the Rubik’s 
cube) and let 


a:H—O Sr 


denote the homomorphism which associates to each move of the Rubik’s cube 
the corresponding permutation of the edges. 


10.1.2 Corner orientations 


Let v : H — C8 be the function which associates to each move g € H the 
corresponding corner orientations. More precisely, let g € H and say g moves 
corner z to corner j. Then v;(g) € C3 is the orientation which the 7” vertex 
gets sent to by g, where the vertices are labeled as in the diagram shown and 
where the orientation is the number of 120° clockwise twists required to turn 
the relative reference ”+” obtained by moving corner 2 to 7 using the move 
g into the standard reference ”+” on corner j. 


Example 198 We have 
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Xx aX) 

F | (2,0,0,,1,0,0,2) 
U | (0,0,0,0,0,0,0,0) 
F*U | (2,0,0,1,1,0,0,2) 
U*F | (2,0,0,1,1,0,0,2) 
D | (0,0,0,0,0,0,0,0) 
B | (0,1,2,0,0,2,1,0) 
R | (1,2,0,0,2,1,0,0) 
L | (0,0,1,2,0,0,2,1) 


Remark 20 The effect of a move g € H on the corner orientations may 
also be regarded as a relabeling of the ”+” markings. 


Note that a move g € H has two effects on the corners: 

(a) a permutation p(g) € Sy of the vertices, 

(b) a reorientation of the vertices moves in (a). 
In particular, for g,h € H, the orientation v(gh) can only differ from v(g) in 
the coordinates corresponding to the vertices permuted by h. 

We shall now verify that the ”relative” orientation v(gh) — v(g) is the 
same as the orientation v(h), provided one takes into account the effect of g 
on the vertices: v(h) = p(g)(v(gh) — v(g)), ie. 


Lemma 199 o(gh) = v(g) + p(g)*(wh))- 


proof: The move gh orients the 7" corner subcube by v;(gh) and per- 
mutes the vertices by p(gh), by definition. 

On the other hand, gf will first act by g then h. The move g will reorient 
the i” corner subcube by v;(g) and send the 2” vertex to the p(g)(z)™ vertex. 

To study the subsequent effect of A on this, let us subtract v(g) from 
v(gh), so that we are back to our original orientation (we will add v(g) back 
in later). Call this position the modified cube for now. 

The move h first orients the 7" corner subcube of the modified cube by 
v;(h) and permutes it to vertex p(h)(j). The 2” subcube of the modified 
cube comes from (via g) the p(g)~1(z) subcube of the original cube. Thus 
the i" corner subcube of the modified cube is, by means of h, reoriented by 
Vp(g)-1( (2). To this we must add in v,(g) to get the total effect of gh on the 


i” vertex of the original: 


vi(gh) = v9) + Pa lh)), 
for each 1 <2 <8, which implies Lemma 199. 
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10.1.3. Edge orientations 


Let w: H — C}* be the function which associates to each move g € H the 
corresponding corner orientations. More precisely, let g € H and say g moves 
corner 7 to corner j. Then w;(g) € C2 is the orientation which the 7” vertex 
gets sent to by g, where the vertices are labeled as in the diagram shown and 
where the orientation is the number of 180° flips required to turn the relative 
reference ”+” obtained by moving corner 2 to 7 using the move g into the 
standard reference ”+” on corner j. 
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w(X) 
(1,0,0,0,0,0,0,0, 1,0,0,0) 
(1,0,1,0,0,0,0,0,0,0,0,0) 
F*U | (1,0,1,0,1,0,0,0,1,0,0,0) 
U*F | (1,1,1,0,0,0,0,0,1,0,0,0) 
(0,0,0,0,0,0, 1,1,0,0,0,0) 
(0,0,0,0,0,0,0,0,0,1,0, 1) 
(0,1,0,0,0,1,1,0,0,1,0,0) 
(0,0,0,0,1,0,0,0, 1,0,0,0) 


3 


% 


Haus 


Remark 21 The effect of a move g € H on the edge orientations may also 
be regarded as a relabeling of the ”+” markings. 


Note that a move g € H has two effects on the edges: 

(a) a permutation o(g) € Sz of the edges, 

(b) a reorientation of the edges which were moved in (a). 
In particular, for g,h € H, the orientation w(gh) can only differ from w(g) 
in the coordinates corresponding to the edges permuted by h. 

We shall now claim that 


B(gh) = B(g) + o(g)*(@(h)), (10.1) 


ie., that 

wi(gh) = wi(g) + Wo (h), 
for each 1 <2 < 12. The proof of this is so similar to the proof of Lemma 
199 that we leave it to the student to modify its proof to verify (10.1). 
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10.1.4 The semi-direct product 


Consider the following direct product of two semi-direct products: 
H'= (CS >< Sy) x (C3? >< Sr). 


Remark 22 This may also we written in the notation of wreath products as 
the following direct product of two wreath products: 


H' = (Sy wr C8) x (Sp wr C3”). 


As a set, we think of H as belonging to C3 x Sy x C$ x Sy. If we represent 
elements h,h! of H ash =(v,r,w,s),h’ = (v',r', wu’, 3’) © C3 x Sy x C3 x Sy 
then the group operation will be given by 


hxh' = (v,r,w,s) *(v',r’, ww’, s’) = (vu + P(r)(v’), rr’, w + P(s)(w’), 8s’). 
Consider the function 


t:H 4 (CB >aSy) x (Cy? >< Sp) 
g > — (v(g),p(g), (9), (9). 


Proposition 201 ¢ is an isomorphism, H = H'. 


proof: Since 


the map z is a homomorphism. Since any reorientation and permutation can 
be achieved by some illegal move, ¢ must be surjective. By theorem 60, the 
kernel of ¢ is trivial (this is just a fancy way of saying that if no subcube is 
permuted or reoriented then the cube doesn’t change!). 


10.2 Second fundamental theorem of cube the- 
ory 


First, some preliminaries. We identify, as in 810.1, each g € G with a 4-tuple 


(X(9), (9g), (9), o(9)), 


where 
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e p(g) is the corresponding permutation of the set of vertices V of the 
cube, 


e o(g) is the corresponding permutation of the set of edges FE of the cube, 
e v(g), w(g) are “orientations” defined in 810.1. 


Remark 23 Let S,, denote the symmetric group on n letters and identify Sy 
with Sg, Sp with Sj,. By example 138, we know that 

(a) p: G— Ss is a homomorphism, 

(b) 0: G— S4q is a homomorphism. 


Question: Given a 4-tuple (v,r,w,s), where r,s are permutations of the 
corners, resp. edges, as above and 


vec, wecy’, 


what conditions on r,s, v, w insure that it corresponds to a possible position 
of the Rubik’s cube? 
The following result is, according to [BCG], due to Ann Scott. 


Theorem 202 (Second fundamental theorem of cube theory) A 4-tuple (v,r, w, s) 
as above (r € Sg, 8 € Sia, v € C$, w € Cy") corresponds to a possible position 
of the Rubik’s cube if and only if 

(a) sgn(r) = sgn(s), (“equal parity as permutations”) 

(b) uy +... + vg = 0 (mod 3), (“conservation of total twists”) 

(c) wy +... + wig =0 (mod 2), ("conservation of total flips”). 


proof: First we prove the ”only if? part. That is, we assume that 
(v,r,w,s) € Sy x Sp x C8 x Cy represents a (legally obtained!) position of 
the Rubik’s cube. From this we want to prove (a)-(c). 

Let g € G be the element which moves the Rubik’s cube from the solved 
position to the position associated to this 4-tuple. Then r = p(g) and 
s = a(g). We know that g may be written as a word in the basic moves 
R,L,U,D,F,B, say g = X1...X,, where each X; is equal to one of the 
R,L,U,D,F,B. Observe that if X is any one of these basic moves then 
sgn(p(X)) = sgn(o(X)). Since sgn, p, and o are homomorphisms, it follows 
that 


k k 


sgn(r) = sgn(o(g)) = [] sgn(o(Xi)) = T] san(o( Xi) = sgn(o(X)) = sgn(s), 
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This proves (a). 

We have verified (b) for the basic moves in example 198 above. Note that 

(i) the conservation of twists condition in (b) is true for (v1,..., vs) if and 
only if it is true for any permutation P(p)(v) = (va)p,-+ %(8)p) 

(ii) if (v1,.., vg) and (v4, ..., vg) each satisfy the conservation of twists con- 
dition in (b) then their sum also satisfies it. 

As above, write g as a word in the basic moves R,L,U,D,F,B, say 
g = X,...X;%, where each X; is equal to one of the R,L,U,D,F,B. We 
assume that this expression is minimal in the sense that we choose the X; so 
that k is as small as possible. This & is called the length of g. (This length 
is the same as the distance from g to the identity in the Cayley graph of G.) 

We now prove (b) by induction on the length. We have already checked 
it for all words of length k = 1. 

Assume k > 1. By the formula giving the orientation of the product of 
two moves in terms of the two orientations of the moves, we have 


BUX... Xp 1X) = p(X,...Xp-1)7 | (8(X;,)) + vUX,... X41). 


The term p(X1...X4-1)7'(0(X;)) satisfies the conservation of twists condi- 
tion in (b) by (i) above. The term v(X4...X;,) satisfies the conservation of 
twists condition in (b) by the induction hypothesis. Their sum satisfies the 
conservation of twists condition in (b) by (ii) above. This proves (b). 

The proof of (c) is very similar to the proof of (b), except that we use 
example 200 in place of example 198. 


Exercise 10.2.1 Provide the details. 


Now, we must prove the theorem in the ”if” direction. In other words, 
assuming (a), (b), and (c) we must show that there is a corresponding legal 
position of the Rubik’s cube. This part of the proof is constructive. 

First, we prove a special case. Assume that r and s are both the identity 
and that (w1,...,wiz2) = (0,...,0). 

There is a move which twists exactly two corners and preserves the 
orientations and positions of all other subcubes. For example, the move 
g = (R'D?RB'U’B) twists the ufr corner by 120° clockwise, the bdl 
corner by 240° clockwise, and preserves the orientations and positions of all 
other subcubes. This move can be easily modified, by a suitable conjugation, 
to obtain a move which twists any pair of corners, and preserves the orienta- 
tions and positions of all other subcubes. These moves generate all possible 
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8-tuples satisfying the conservation of twists condition in (b). This proves 
the “if” part of the theorem in the case that r and s are both the identity 
and that (wy, ...,wi2) = (0,...,0). 

Next, we prove another special case. Assume that r and s are both the 
identity and that (v1, ...,vg) = (0,...,0). 

There is a move which flips exactly two edges and preserves the orien- 
tations and positions of all other subcubes. For example, the move g = 
LFRF-'L'U?RURU' R?U? R (found in [B], page 112) flips the uf edge, 
the ur edge, and preserves the orientations and positions of all other sub- 
cubes. This move can be easily modified, by a suitable conjugation, to ob- 
tain a move which flips any pair of edges, and preserves the orientations and 
positions of all other subcubes. These moves generate all possible 12-tuples 
satisfying the conservation of flips condition in (c). This proves the ”if” 
part of the theorem in the case that r and s are both the identity and that 
(v1,...,08) = (0,...,0). 

As a consequence of these last two special cases, it follows that the ”if” 
part of the theorem is true in the case that r and s are both the identity. 

Finally, we prove our last special case. Assume that (v,..., vg) = (0,...,0) 
and that (wy, ...,Wi2) = (0,...,0). Consider the following three claims. 


e Given any three edges subcubes, there is a move which is a 3-cycle on 
these edges and preserves the orientations and positions of all other 
subcubes. 


e Given any three corners, there is a move which is a 3-cycle on these 
corners and preserves the orientations and positions of all other sub- 
cubes. 


e Given any pair of edges and any pair of corners, there is a move which 
is a 2-cycle on these edges, a 2-cycle on these corners, and preserves 
the orientations and positions of all other subcubes. 


Exercise 10.2.2 Verify these three claims. 


By proposition 159, we know that Ag is generated by the edge 3-cycles 
above and that Ay is generated by the corner 3-cycles above. In other words, 
we can construct a position of the Rubik’s cube associated to any 4-tuple 
(r,s,0,0), provided r € Ay and s € Ag. The subgroup Ag x Ay is index 
4 in Sg x Sy since |S,/A,| = 2. The third type of move, the edge-corner 
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2-cycles above, does not correspond to an element of the subset Ag x Ay of 
the Rubik’s cube group because an edge 2-cycle is an odd permutation of the 
edges. Therefore, if we consider the subgroup of Sz x Sy generated by all 
three types of moves we will obtain either all of Sg x Sy or some subgroup 
of index 2 which properly contains Ag x Ay. The first possibility can be 
ruled out since it contradicts the parity condition in (a). The only subgroup 
of Se x Sy of index 2 which properly contains Ag x Ay is the subgroup of 
elements satisfying the parity condition in (a). 

It follows that the if” part of the theorem is true in the case that v and 
w are both zero. 

The theorem is a consequence of these special cases because of the fol- 
lowing 

Claim: There is always a move, no matter what position of the Rubik’s 
cube is in, which does not permute any subcubes but ”solves” the orientation 
of the cube so that v and w are both zero. 


Exercise 10.2.3 Prove this claim. 


Corollary 203 G = {g = (v,r,w,s) € H | (a), (8), (c) in the above theorem hold}. 


10.2.1 Some consequences 


We shall now reformulate the above fact about the Rubik’s cube group from 
a point of view which (to me anyway) allows us to count the number of 
elements it has easier. Let 


Go = {(v, 7, w, 8) | re Sg, SE S49, 
V = (UY, v2, ..,08),¥; € {0,1,2},0, +... + vg = 0 (mod 3), 
w = (Wy, We, ...,Wi2), W; € {0,1}, wi +... + wig = 0 (mod 2)}. 
Define a binary operation * : Gp X Go — Go by 
(v,r,w,s)* (vr, w’,s’) = (vt P(r)(v'),r *r’,w + P(s)(w’), 5 * 8’). 


This defines a group structure on Go. This is a subgroup of the enlarged 
Rubik’s cube group of index 6. 
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Theorem 204 There is an isomorphism 
Go = (CS >< Ss) x (Cy! >< S12), 


where C,, is the cyclic group with n elements and >< denotes the semi-direct 
product and where Ck (n = 2,3, k =7,11) is identified with the subgroup of 
C+! defined by 


{v = (U4, v2,-.,0%) |v € {0,l,n —1},0, +... +0, = 0 (mod n)}. 
In particular, 
Go| = |Ss|[Si2||CZ"||Cs| = 8!-12!- 21. 37, 
Theorem 205 The Rubtk’s cube group G is the kernel of the homomorphism 
6:G> {L,-1} 


(v,r,w,s) > sgn(r)sgn(s). 
In particular, G < Go is normal of index 2 and 
|G| = 8!- 121. 210. 37, 


Recall that the commutator subgroup G, of G is the subgroup consisting 
of all finite products of commutators 


[yh] =ge hag eh”, 
where g, h are arbitary elements of G. 
Theorem 206 |G,| = |G|/2. 
In fact, we can explicitly determine G',. 
Theorem 207 G, = {9g € G | sgn(p(g)) = sgn(o(g)) = 1}. 


This basically follows from the fact that the commutator subgroup of 5S, 
is A,,, for n > 4 (see chapter 3 of [R]; in fact, for n > 4, A, is the only proper 
non-trivial normal subgroup of S,,). 

The above theorem implies that |G/G| = 2 (to see this, use the first 
homomorphism theorem). From this, it clearly follows (from those who see 
it clearly) that G, is a normal subgroup of G. 
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10.3. The homology group of the square 1 puz- 
zle 


This section is based on a paper written jointly with J. McShea [JM]. 


Here we study the group theoretic properties of the collection G' of all 
*words” in the basic moves of the square 1 puzzle which preserve the cube 
shape. This collection G forms a group which, motivated by [W], we call 
the homology group of the square | puzzle. The list of shapes which the 
square | puzzle can make is given in [Sn2]. It is not hard to see that the 
homology group of any one of these other shapes is conjugate to G’, so from 
a group-theoretic standpoint, we may focus our attention on the cube. We 
shall also make use of the moves given in [Are] which belong to G. (see Fig. 


10.6) 


Figure 10.6: 


A bi-product of the proof is an collection of moves which can be used to 
solve the square | puzzle, once it is put in the cube shape. 
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10.3.1 The main result 


Let 5, denote the symmetric group of degree n, i.e., the group of permu- 


tations of {1,2,...,n}. Let sgn : S$, — {+1} denote the homomorphism 
which assigns to each permutation its sign (the sign of a cyclic permutation 
of length r is (—1)’*', for example). 

We shall see that the size of the homology group of the square | is about 
.8 billion. 


Theorem 208 G is isomorphic to the kernel of index 2 in Sg x Sg of the 
homomorphism f : Sg x Sg + {+1} defined by f(g1, 92) = sgn(gi)sgn(ge). 
Consequently, |G| = 2'°345?7? = 812851200. 


As a corollary of the proof of this theorem, given below, we shall see that 
any even permutation of the corners is possible and any even permutation of 
the wedges is possible. 

Let H denote the enlarged square | group generated by all legal moves 
preserving the cube shape and all illegal moves (i.e., disassembly and re- 
assembly is allowed) preserving the cube shape. It is clear that 


H = Sg x Sg. 


Some notation 


We shall assume that the puzzle is in the solved position with the ”square 
1” side in front, right-side up. Let 


e wu denote rotation of the up face by 30° clockwise, 
e d denote rotation of the up face by 30° clockwise, 


e F denote rotation of the cube by 180° though one of the skew-diagonal 
cuts (in a given position, at most one such move is possible, so this is 
unambiguous). 


Like the 15 puzzle, and unlike the Rubik’s cube, not any sequence of u, d, 
and R’s is possible. 
Let 


T(z,y)=u*xReexy*Reu', Bley) =d'*Reaxy* Red, 
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up face of square 1 


V/ 
A 


front 


Figure 10.7: 


down face of square 1 


front 


ag 


V/ 


Figure 10.8: 
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where x,y are moves of the square | puzzle. (see Fig 10.7 and Fig. 10.8) 
In the notation of these diagrams, we have 


uRu—'d7' Rd = (2,8)(4,6) 


T (uw, 1) = (W,6/,77,4")(1,6,7,4) 
T(1,d°) = (2',3',8,5\(2,3,8,5) 
Bi, 1) = (0,2, 7,8)(1.6,77,4) 
B(1,d") = 


(3,4, 5,6)(2", 3’, 8/5’). 


Two subgroups 


Let 
G, =< T(u’,1), Td’) > 


and 


Ga =< B(u*,1), BU, d’) > 
Lemma 209 G, and Gy are each isomorphic to C4 x C4. 


proof: We have T(u?,1)7(1,d°) = T(1,d*)T(u?,1). Moreover, T(u*,1) 
and T(1,d*) are each of order 4. Since 


Cy x Cy=<a,b| at =1,b'=1,ab= bas, 


the lemma follows. 


The homology group of the square 1 puzzle is defined to be 
G =< du’, Blu’, 1), BU, @), Plu’, I, PL) > 


We shall use the following labelings to describe the moves of the square | 
puzzle 


10.3.2 Proof of the theorem 


We shall prove the theorem in the following steps: 


e Show that the wedge 3-cycle (1,2,3) and the corner 3-cycle (1, 2’, 3’) 
each belong to G. 


e Show that any wedge 3-cycle (1, 2,7) and each corner 3-cycle (1', 2’,2’) 
belong to G. 
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e Show that there is a injective homomorphism ¢: G— Sg x Sg where 
the image ¢(G) contains Ag x Ag. 


e Conclude that G = Sg x Sg/{+1}. 


Step 1: First, we claim that (1,2,3) belongs to G. In fact, the 3-cycle 
(1,2,3) is obtained from the move 


M, = (B(u?, 1)*d?)*((B(u?, L)ed7?)*(B(u7®, 1x01, d7?)*d®)))4(B(u®, L)*d?)7t. 


(Incidently, this 80 move long manuever may be verified using GAP [Gap]. 
See also [Sn2].) 
Next, we claim that (1’, 2’,3’) belongs to G. In fact, 


Mz = Ru? Rd~° Ru?(Ru7*)?d Ru? 


is the product of 2-cycles (2’, 3’)(3,4). (This move was found in [Sn2].) There- 
fore, u?Mzu7° is the product of 2-cycles (1',2')(2,3). The product of these 
is (1, 2',3')(2,3,4). Since (2,3,4) is obtained from u~?M,u", we see that 
(1’, 2’, 3") is in G. (This may also be verified using GAP.) 

Step 2: Let g be any move in G' which sends wedges 3 to wedge 2, resp., 
and does not move wedges 1,2 (it may permute other wedges and corners). 
Then (1,2,7) = g * (1,2,3) *g7*. Thus (1, 2,7) €G. 

The proof that each (1’, 2,7’) € G is similar. 

Step 3: It is clear from our definition that there is an injection G > Sgx Sx: 
as sets. The verification that this is a homomorphism is straightforward. 

Step 4: The group Ag is generated by the 3-cycles (1,2,7) (see Lemma 
160). Since these all belong to G, all even wedge permutations are possible. 
Similarly, all even corner permutations are possible. Thus Ag x Ag C G. 

Let py : Sg x Sg — Sg denote the projection onto the first factor. 
Let p, denote the projection onto the second factor. For each generator 
g € {d?,u’, B(u®, 1), BO, da’), T(v’, 1), TU, d?)} of G we have sgn(pi(g)) = 
sgn(po(g)). Thus the image ¢(G) is strictly contained in Sg x Ss. In fact, 
this shows that ¢(G) is contained in the kernel ker(f) of the homomorphism 
f : Sg x Sg > {+1} defined in the statement of the theorem. Since 


Ag X Ag C GC ker(f), 


[ker(f) : Ag x Ag] = 2, and T(u*,1) € Ag x Ag, the theorem follows. 
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Chapter 11 


Other Rubik-like puzzle groups 


” An expert is someone who knows some of the worst mistakes 
that can be made in his subject, and how to avoid them.” 
Heisenberg, Werner 


PHYSICS AND BEYOND, 1971 


This chapter shall survey, sometimes without proofs, some results on the 
group-theoretical structure of some of the permutation puzzle groups, as 


discussed in [GT], [Lu], [B], chapter 2, [NST], chapter 19. 


11.1 On the group structure of the skewb 


This section is based on G. Gomes and J. Montague [GM]. 

Notation: We fix an orientation of the cube and label the sides by 
R,L,U, D, F, B as in the case of the Rubik’s cube. The 120 degree clockwise 
rotation of a corner is denoted by a 3-letter juxtaposition of the letters ab- 
breviating the 3 faces which the corner meets. (When you twist a corner of 
the skewb you must permute three other corners but the opposite side of the 
skewb is unaffected.) Such a move will be called a basic move - there are 8 
of them, though twisting about a corner and twisting about the antipodal 
opposite corner is basically the same move (up to a rotation of the entire 
cube.) For example, FRU denotes the 120 degree clockwise rotation of the 
front-right-up corner, leaving the rest of the cube alone. 

Let C denote the set of square center facets and V the set of vertices of 
the cube. 


209 
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Let 
G=< FRU, FLU, BRU, BLU, DF R,DFL,BDR,BDL > 


denote the group of all (legal) skewb moves. Let G* denote the group of all 
legal and “illegal moves” (where disassembly then reassembly is allowed). 
On each square center facet of the skewb we may choose a vertex with the 
following property: if we draw an arrow pointing from the chosen vertex of 
the square to the diametically opposite vertex on the square then the moves 
of the skewb permute these arrows amongst themselves, except that some 
arrows may possibly be reversed. This determines an orientation of each 
center facet. Call this puzzle the super skewb. For this new puzzle, let 


Geuper =< FRU, FLU, BRU, BLU, DFR, DFL, BDR, BDL > 


denote the group of all (legal) super skewb moves. Let G%,,.,. denote the 
group of all legal and “illegal moves” (where disassembly is allowed). 

We orient the corners as in the case for the Rubik’s cube. Let y(g) € 
C3 = {0,1,2}* denote the orientation for the corners. 

For the superskewb, we orient the center facets similarly. Let z(g) € Cf = 
{0,1,2,3}° denote the orientation for the centers. 

Let Sc denote the symmetric group on the set C’, Sy the symmetric group 
on the set V. 


Claim: There are homomorphisms 
p:GoSo, a@:G—Sy, 
given, for each move g € G, by 
p(g) = permutation of the center facets associated to g, 


and 
o(g) = permutation of the vertices associated to g. 


Let 
H=C8 x Sex Sy 


and define *: H x H — H by 


(y.r.s) *(y'yr',8') = (rr, ssl (y!) +y). 
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Let 
super = C$ x Sy x Ce x So 


and define *: Hguper X Hsuper 2 Hsuper by 
(yrzs)*e(yir, 27,3’) = (r7"(y') t+y,rr’, s7!(z") + 2,88’). 


Observation: There is an embedding of G into H and an embedding of 
Guyer into Hyuper: 

Let Go be the group that acts only on the center facets of the skewb, and 
Gy the group that acts only on the vertices. Now, 


G=Go x Gy. 


Every generator of G is a 3-cycle on the center facets. This means that r is 
an element of Ac. It is a fact that the elements (7,j,k) of 5, generate A, 
(for 2,7, & elements of {1,2,...,n}). Therefore, Go = Ag. 

The group that acts on the vertices of the skewb is slightly more com- 
plicated. Unlike the Rubik’s cube, there is no condition like conservation 
of twists which applies to the entire vertex set. Instead, we must split the 
vertices of the skewb into two 4-corner orbits. This idea is borrowed from 
Bandelow’s booklet on Mickey’s Challenge (a puzzle similar to the skewb). 
An orbit is constructed by starting with one corner and including the op- 
posite corner of each face that meets at the first corner. Referring back to 
our original labeling of the skewb, the orbits are the odd corners {1,3,5, 7}, 
and the even corners {2,4,6,8}. Let the orbit of odd corners be denoted by 
V (odd), and let V(even) denote the orbit of even corners. We now partition 
Gy so that 

Gy = GY (odd) x GV (even): 
We know that each orbit maps to a permutation on 4 vertices and an orien- 
tation on 4 vertices. So 


Gy is a subgroup of (Cf ><4.$4) x (Cf ><1S4). 
Let h = (s,u(h),t, v(h)) be an element of Gy. 


First we will examine the permutations of the vertices of both orbits. 
Each generator produces a 3-cycle on the vertices, whether they are in the 
odd or even orbit. Therefore, we can say that the permutations of each orbit 
generate A, by the same argument used for the center permutations. Now, 


Gy is a subgroup of (C3 >< Ay) x (CP >< Aq). 
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Claim 1: There exist h, such that s is an element of Aq, u(h) = (0,0,0,0), 
t is an element of Ay, and v(h) = (0,0,0,0). We know this is true because 
there are clean skewb moves which only permute 3 vertices. 

Claim 2: Given any permutation u’ of (1,2,0,0) by an element of Ay 
and any permutation v’ of (1,2,0,0) by an element of Aj, there exist h, such 
that s = 1, u(h) = wu’, t = 1, and v(h) = v’. This is true because there are 
clean skewb moves which only twist vertices. 

If we combine the moves of Claims 1 and 2, we should generate all of 
the possible moves of Gy. The condition on each of the vertex 4-tuples will 
drop them in dimension to elements of C3. So we can conclude that Gy is a 
subgroup of index 9 of 


(C3 ><1 Aq) x (C3 >< Ay). 


Note: This claim is verified by GAP. 
Since Go = Ag, and Gy = (C3 >< Aq) x (C3 >< Ag), we can conclude 
that 
G = Ag x (C3 >< Ag) x (C$ >< Ay). 


and 


|G] = (6!/2) * (41/2)? * (3°) = 37, 791, 360. 


In conclusion, it is interesting to note that if we let G’ denote the illegal 
skewb group - where reassembly is permitted - then 


G' = S6 x Sg x C8. 


and 


IG|/|G"| = .0001984127... 


This means that if you could take apart the skewb and reassemble it however 
you wanted (leaving the stickers intact however), then only about .02 percent 
of all possible reassemblies would be solvable. The analogous percentage for 
the Rubik’s cube is 8.33.. percent. As M. Schonert points out in a post to 
[CL], this makes the skewb harder to solve that the Rubik’s cube in some 
sense . 
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Permutation and Orientation Tables 


Move | Center Permutation | Vertex Permutation 
UFR (1 5 2) (2 6 4) 
UFL (1 45) (1 7 3) 
DFR (1 2 6) (1 57) 
DFL (1 6 4) (4 6 8) 
BRU (2 5 3) (1 35) 
BLU (3.5 4) (2 48) 
DBR (2 3 6) (2 8 6) 
DBL (3 4 6) (3 75) 
Move Vertex Orientation 
UFR (12020200) 
UFL (20210020) 
DFR (20002120) 
DFL (0002021 2) 
BRU (21202000) 
BLU (02120002) 
DBR (02001202) 
DBL (00202021) 
UFR*UFL | (02220020) 
DFR*DFL | (200200 2 2) 


Note: The orientations for the generator moves contain two repeated 
orbits - permutations of (1 0 0 0) and permutations of (2 2 2 0). 


11.2 Mathematical description of the 2 x 2 
cube moves 


This section, which is based on [DL], derives the group structure of the 2 x 2 
Rubik’s cube. 

A position on the 2 x 2 cube is determined by 

(a) a permutation of the vertices, and 

(b) the orientation of the corner sub-cubes. 

An illegal move on the 2 x 2 cube is a reassembly of the corners. 

Let 

H=< R,L,U,D,F,B, and all the illegal moves > . 
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This will be called the enlarged 2 x 2 cube group. Let G =< R,L,U,D,F,B>. 
G is contained in H with G < dH. 

Let C$? = {0,1,2}* be the group of 8-tuples with coordinate-wise addition 
mod 3. Let v: H — C$ be defined as follows: Assume h € H sends the 7” 
corner to the j corner. v;(h) is the number in C3 = {0,1,2} which describes 
the orientation that the standard reference marking of the 2’” corner is sent 
to relative to the standard reference marking of the j corner. The values 
of v are tabulated in (198). 

Let Sy be the group of permutation of corner sub-cubes. We may identify 
Sy with Sg since we have labeled the corners 1,...,8. H is a subset of the 
Cartesian product Sy x C8. 

Let p(h) denote the permutation of the vertices of the cube associated to 


h © H. We have 


It is not hard to show, based on the results of the previous section, that 
H = C8 ><8s = {(v,r) |r € Sy, v € C$}. In other words, H is the wreath 
product of Ss and C3. 


Theorem 210 A two-tuple (v,r) € C3 x Sy corresponds to a legal position 
iff ur +... +08 = 0 (mod 3) (conservation of twists). 


proof: PART 1: In this part, we show that any pair (v,r) as in the 
theorem (where v satisfies conservation of twists) corresponds to a legal move 
g in such a way that r = p(g) and v = W(g). 

Case 1: Assume r = 1 and v is arbitrary. From the solved position, 
any two corners, corner 2 and corner ) say, can be twisted so that corner 
2 has orientation 1, corner 7 has orientation 2, and all other corners have 
orientation 0. Call such a move e;,;. Example: (R7!* D? * R* B7!*U? * By? 
1S €9,7. 

Let y = a, * €1,g+...+ 47 * 78, where a; € {0,1,2}. This is a move of the 
2x2 Rubik’s cube of the form (v, 1) - in other words, it permutes nothing but 
may twist some corners. By construction, all moves of this form are legal. 
For each a; * €;, there are three different possible positions (independent of 
all other a; * €;,3). Since there three choices for each a;, there are a total of 
3° distinct moves of the form y as above. 
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On the other hand, there are exactly 3° possible moves of the form (v, 1) 
which satisfy the conservation of twists. (proof: If v = (w%,...,08), v7 € 
{0,1,2}, and v4. +...+ vg = 0 (mod 3), then there are 3 ways to choose each 
of v1,..,v7 but then once these are fixed the consevation of twists condition 
leaves no choice for vg. This leaves a total of 3° choices.) These 37 possible 
moves include, of course, the legal moves of the form y above. Thus every 
move of the form (1,v), with v satisfying conservation of twists, is legal. 


Case 2: Assume v = 0 and r is arbitrary. Recall Sg is generated by the 
two-cycles (see chapter 3 above, §$3.3-3.4). 


Claim 1: Given any pair of corners, there is a 2-cycle move which swaps 
them. (Example: F-' * U * Bx U7! * Fx U? * Bol * U * Bx U? * Bot). 
Once two corners have been swapped, you may correct the orientation of any 
sub-cube by Case 1. Thus any permutation which preserves orientations is a 
legal moves. 


Case 3: Assume v and r are both arbitrary but satisfying conservation of 
twists. By case 2, we may make a legal move that changes (v,1r) to (v,1). 
By case 1, (v,1) is a legal move. 


PART 2: In this part, we show that any legal move satisfies conservation 
of twists. 


Assume (v,r) € C3 x Sy is a legal move. 


Define the length of a move g € G to be the smallest number n of gener- 
ators needed to create the move, written length(g) =n. 


Induction hypothesis: If a move is length n, it satisfies conservation of 
twists. 


step n = 1: Every v(2) where x € {R,L,U,D,F, B} satisfies the conser- 
vation of twists. 


step n > 1: Assume the induction hypothesis is true for all lengths < n—1. 
Let x be length n and write x = 21 * x2, where length(a1) <n —1 and 
length(x2) = 1. Then v(x) = %(21) + p(v1)0(v2), by the group operation. 
Furthermore, v(21) satisfies conservation of twists by the induction hypothe- 
sis. Since p(2,) simply permutes the coordinates of U(x), (x2) still satisfies 
the conservation of twists. The sum of moves satisfying conservation of twists 
still satisfies conservation of twists. 


Conclusion: by induction, any move (v,r) € C3 x Sy that is a legal move 
satisfies the conservation of twists. 
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11.3. On the group structure of the pyraminx 


The results of this section were worked out in A. Luers [Lul. 
Notation: Let 


V denote the vertices of the tetrahedron (which we identify with the 
set of corner pieces of the pyraminx), 


E denote the edges of the tetrahedron (which we identify with the set 
of edge pieces of the pyraminx), 


C’ the set of interior pieces of the tetrahedon (ie, movable pieces of the 
pyraminx not in FE or V), 


Sy the permutation group of V, 
Ay the alternating group of V, 
Sg the permutation group of F, 


Ag the alternating group of FE. 


Assume that the tetrahedron is lying on a flat surface in front of you, 


with the triangle base pointing away from you. The corners are denoted L 
(left), R (right), U (wp), and B (back). 


Basic Moves: Opposite each corner or vertex there are three layers: the 


tip, the middle layer, and the opposite face. Let 


e / denote the 120 degree clockwise rotation of the tip containing the left 


corner, 


L denote the 120 degree clockwise rotation of the tip/middle layer 
containing the left corner, 


r denote the 120 degree clockwise rotation of the tip containing the 
right corner, 


R denote the 120 degree clockwise rotation of the tip/middle layer 
containing the right corner, 


u denote the 120 degree clockwise rotation of the tip containing the up 
corner, 
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e U denote the 120 degree clockwise rotation of the tip/middle layer 
containing the up corner, 


e b denote the 120 degree clockwise rotation of the tip containing the 
back corner, 


e B denote the 120 degree clockwise rotation of the tip/middle layer 
containing the back corner. 


Let G=< R,L,U,B,r,l,u,b > denote the pyraminx group. 

Each move g € G induces a permutation of EF denoted o(g). Note that 
G does not permute the vertices. Furthermore, the tip moves r,/, u,b do not 
effect the edges. 


Lemma 211 0: G— Sz is a group homomorphism. 


Example 212 p(L) is a 3-cycle in Sy, o(L) is a 3-cycle in Sp. 


11.3.1 Orientations 


Assume for the moment that the pyraminx is fixed in space as above and is in 
the ’solved” position. For each corner or edge piece, choose once and for all 
one facet on that piece. There are three possible choices for each corner piece 
and two for the edges. Mark each of these choosen facets with an imaginary 
+’, leaving the other facets unmarked. For the rest of this section, we 
shall make the following choices for the marked facets (with reference to the 
numbering in 84.7 ): 


e marked edge facets: 4,6, 10,15, 20, 25 


e marked corner facets: 1, 13,17, 23 


For each edge piece, assign to a move g € G either 


e a’0’ if the ’+ facet’ for that piece when it was in the solved position is 
sent to the ’+ facet’ for that piece when it was in the present position, 
s] 


e a’l’ otherwise, 


This yields a 6-tuple of 0’s and 1’s: W(g) = (wy, wa, ..., We). 
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Example 213 We compute the effect of the basic twist moves on the edge 
orientations: 


w(X) 
(0,0,0,1,0,1) 
(0,0,1,0,1,0) 
(1,0, 1,0,0,0) 
(0,1,0,1,0,0) 
RxU-* RU | (1,1,0,0,0,0) 


Th by | 


For each corner piece, assign to a move g € G either 


e a0’ if the ’+ facet’ for that piece when it was in the solved position is 
sent to the ’+ facet’ for that piece in the present position, 


e a’l’ if the ’+ facet’ for that piece when it was in the solved position is 
sent to the facet which is a 120 degrees rotation about its vertex from 
the ’+ facet’ for that piece in the present position, 


e a 2’ otherwise, thus yielding a 4-tuple of 0’s, 1’s, and 2’s: v(g) = 
(v1, V2, 03, V4). 


Example 214 We compute the effect of the basic twist moves on the corner 
orientations: 


X] aX) 

B | (0,0,0,2) 
R | (0,0,2,0) 
L | (0,2,0,0) 
U | (2,0,0,0) 


Proposition 215 /f wW(g) = (wi, wo2,...,wWe) corresponds to a moveg EG 
then 


wy +we+...+ we = 0(mod 2). 


Observation: There is no corresponding condition for the v1, ..., v4, 
since corner moves move them around freely. 

proof: The proof uses the following lemma, but is otherwise essentially 
the same as the corresponding fact (Theorem 202 (c)) which we proved for 
the Rubik’s cube. The modifications required for the proof are left to the 
student as an exercise to test their understanding of the argument. 
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Lemma 216 Forg,h € G, we have 
B(g *h) = o(g)*(W(h)) + Wg): 


proof: The proof is essentially the same as the corresponding fact (Lemma 
199) which we proved for the Rubik’s cube. The modifications required for 
the proof are left to the reader as an exercise. 


Lemma 217 For g,h € G, we have 
B(g *h) = v(h) + vy). 
Exercise 11.3.1 Prove this lemma. 


Let H denote the enlarged pyraminx group generated by G and the ”il- 
legal edge moves” (that is, one may physically remove the edge pieces and 
reassemble the pyraminx. [legal center or corner moves are not allowed in 


A. Let 


H* = {(s,v,y) |r € Sy,s € Sg,x € Ch,y € C$} 
and define « : H* x H* — H* by 
(s,z,y) *(s),2',y') =(s*s,r+a',s(y)+y’). 
Theorem 218 e H* is, with this operation, a group. 
e There is are isomorphisms 
H = H* =C}x (Cf ><Sz), 


and hence between H and the direct product of the tip moves C3 and 
the wreath product 
C3 x (Sg wr Co). 
e The map G > H* defined by 


> 


g+— ((9), Hg), B(g)), 


is a homomorphism. 
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11.3.2 Center pieces 


Each corner piece has 3 center pieces neighboring it. 
Facts: 


e These center pieces, in the middle layer down from the corner and never 
be moved into any other corner’s middle layer. 


e The center pieces associated to a corner can never be moved into a 
middle layer associated to another corner. 


e The center pieces associated to a corner can always be color-aligned 
with the colors of the corner piece by a corner twist move. 


The third part says, in other words, that the center pieces can always be 


”solved” by a corner piece. 


11.3.3. The group structure 


Theorem 219 G is isomorphic to 


{(s,2,y) € H* | s even,y, + yo +...+ ye = 0(mod 2)}. 


The idea to prove this is to show that 


Ag =< o(R),0(L),o(U),o(B) > 
G—- Ag, 
g +> o(9), 


is surjective, 


eG = {(w,...,we) € CP | wi +... + we = 0(mod 2)}, g > Wg), is 
surjective (as a map of sets). 
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Here’s the proof of the first point: We can label (as in $5.10 ) the edges 
1,2,...,6 so that the edges on the front face are 1,2,3, resp., where the 
fl edge is 1, the fr edge is 2, and the fd edge is 3 (here f, r, 1, and d 
denote the front face, right face, left face, and down face, resp.). The move 
[R,U~'] = Rx U~' * Ro! *U is the counterclockwise cyclic permutation 
(1,3,2) of the edges on the f face. (This move does not affect any corners 
but does flip some edges, a fact which we may ignore for now since we are 
only concerned with the permutations now.) In particular, (1,3,2) may be 
written as a product of the generators in {o(R),o(L),a(U), o( B)}. Now pick 
any 2 € {4,5,6} and let s € G denote a move which sends edge 7 to edge 2 
and does not move edge 1 or 3. The move s * [R,U~*] * s~' is the 3-cycle 
(1,3,2). It does not affect any corners or other edges. By Lemma 160, these 


permutations generate Ag = Ag. 


The second point follows immediately from the first point proven above. 
Here’s the proof of the last point: The move g = [R, U~'] has the following 
effect on the orientation: w(g) = (1,1,0,0,0,0). The group 


{(21, 1, We) € CS | Wy +e We = 0(mod 2)} ~ C3 


is a vector space over Fj. The 5 vectors listed in the table for the values for 
w are all independent. It follows from this and the group law for G' proves 
that the map g +> w(g) is surjective. 

The theorem 219 above is thus proven. 


11.4. A uniform approach 


This section shall follow [GT] in a uniform discussion of the pyraminx, the 
3 x 3 Rubik’s cube, and the megaminx. Other puzzle groups are analyzed in 
[GT] (see also [B], chapter 2, [NST], chapter 19). 

Notation: Let 


e G, (resp., Grp, G,,) denote the permutation puzzle group generated by 
the basic moves of the pyraminx (resp., the Rubik’s cube, megaminx), 


e V, (resp., Vr, Vn) denote the set of vertex pieces of the pyraminx (resp., 
the Rubik’s cube, megaminx), 


e E, (resp., Er, Em) denote the set of edge pieces of the pyraminx (resp., 
the Rubik’s cube, megaminx), 
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e F, (resp., Fr, F,) denote the set of facets of the movable pieces of the 
pyraminx (resp., the Rubik’s cube, megaminx). 


11.4.1 General remarks 


Let G,V,E,F (resp.) denote either G,, E,, Vp, F (vesp.), or Gr, Er, Vr, Fr 
(resp.), or Gin, Em, Viny Fim (tesp.). 


Lemma 220 G acts on the set V, (resp., E, F). 


If g is any move in G then, since g acts on the sets V, F, and fF’, we may 
regard g 


e as an element of the symmetric group Sy of V, 
e as an element of the symmetric group Sg of FE, or 
e as an element of the symmetric group Sp of F. 


These groups Sy, Sg, and Sp are different, so to distinguish these three 
ways of regarding g, let us write 


e gv for the element of Sy corresponding to g, 
e gg for the element of Sg corresponding to g, 
e gr for the element of Sp corresponding to g. 


What is the kernel of fy? What is its image? To answer this question 
(actually, we shall not answer this precise question but one similar to it) we 
introduce a certain subgroup of the symmetric group. 

Recall the alternating group Ax is the subgroup of all even permutations 
of X (in the sense of Example 82 above). 


11.4.2 Parity conditions 


Consider the function 


fve:G— Sy X Sz 
g— (9v,9z) 


It is easy to check that this is a homomorphism. 
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Theorem 221 The image fyz(G) of fvg is isomorphic to 


Ay X Ag, for the pyraminx, megaminx, 


{ {(z,y) € Sy x Sg | x,y both even or both odd}, for Rubik's cube. 


This is a consequence of a result proven below in the case of the Rubik’s 
cube. To see what this theorem means, we look at an example. 


Example 222 Let G=Gp. 

Question: Can you find a move of the Rubik’s cube which flips a single 
edge subcube over, leaving the rest of the puzzle pieces unmoved? 

If so, then the image of fry would have to contain an element (x,y) with 
x = 1 (since moving an edge only does not effect the vertices) and where y 
is a 2-cycle. But x = 1 is even and a 2-cycle is odd. This contradicts the 
theorem, which says that x,y are either both even or both odd. Therefore, the 
answer is no: a single edge flip is impossible. 


Next, some more notation: let 
K= ker(fve) dG 


denote the kernel of the map fyg introduced above. This is a normal sub- 
group of G. 


Example 223 In the case of the Rubik’s cube, this subgroup K is the set of 
moves which may reorient (i.e., flip or rotate) a subcube but does not swap 
it with some other subcube. For example, the move 


(Rol * D? * R* Bo! *U? * BY’, 


which twists the ufr corner clockwise and the bld corner counterclockwise, 
belongs to K. 


Theorem 224 (Gold, Turner [GT]) G is a semi-direct product of K with 
fve(G). 


This is a consequence of a result proven above in the case of the Rubik’s 
cube. In the case of the 3 x 3 Rubik’s cube, some more details are given in 


chapter 10. See also [GT] or [NST], chapter 19. 
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Chapter 12 


Interesting subgroups of the 
cube group 


” (Lefschetz and Einstein] had a running debate for many years. 
Lefschetz insisted that there was difficult mathematics. Einstein 
said that there was no difficult mathematics, only stupid mathe- 
maticians. I think that the history of mathematics is on the side 
of Einstein.” 


Richard Bellman 
EYE OF THE HURRICANE, 1984 


It is remarkable that several familiar” groups may be embedding into 
the Rubik’s cube group, and hence be regarded as a subgroup of the cube 
group. For example, we have seen in an earlier chapter how to embed the 
group of quaternions Q = {1,—1,7,-7,j,--j, k, —k} inside the Rubik’s cube 
group. 

The subgroup method, discussed in the appendix, is a method for inves- 
tigating God’s algorithm using a computer. One of the groups arising in this 
method is the group studied in the first section of this chapter. The second 
section studies the ”two faces” group. 


225 
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12.1 The squares subgroup 


Let G denote the subgroup of the Rubik’s cube group generated by the 
squares of the basic moves: 


G :=< U*, D’, R’, 1’, F’, B’ > 


called the squares group. We shall verify below that the order of this group is 
2!83*. (By the way, as a consequence of this and Burnside’s theorem [R], ch. 
5, it follows that G is a solvable group. We shall not need this fact.) In this 
section, we will investigate the group struction of G using the same method 
which was used to determine the structure of the Rubik’s cube group. 

The group G acts on the set of edges and the set of vertices of the cube. 
There is a choice of orientation of the edges (resp., corners) similar to that 
in §§10.1.2-3 such that each element of G preserves the edge (resp., corner) 
orientations. 

The action ¢ of G on the edges EF of the cube has exactly 3 orbits: the 
middle slice parallel to the right face Ep, the middle slice parallel to the 
front face Ey, the middle slice parallel to the up face Ey. In particular, 
the group G acts (by restriction) on Ep, Er, and Ey. The action ~ of 
G on the set of vertices V has exactly 2 orbits: Vj = {ufr,ubl,dfl,drb}, 
VY, = {ufl,ubr,dfr,dlb}. Therfore, the group G acts (by restriction) on Vi 


and V2. These actions yield associated homomorphisms: 


o:G— Sz, 
OBp:G— SEp, 
dp, 1G Sep, 
Oky 1G Sz, 

o:G— sy, 
Wy: G— Sy, 
wo: G— Sy,. 


Proposition 225 G = 9¢(G) x W(G). 


The proof of this is left as an exercise (hint: use the second fundamental 
theorem of Rubik’s cube theory). 


Lemma 226 [fg € G then 


sgn( rp (g))s9n(GEp(9))s9N( Oxy (g)) = 1 
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Conversely, if (pi, P2,P3) © Sep X Sep X Su, then there is ag € G such that 


Pi = OER (9), P2 = Onp(G9), Ps = Ony(g) Uf and only if sgn(py)sgn(p2)sgn(p3) = 
1. 


As a consequence, we find that 


(G) = ker(sgn x sgn xX sgn: og,(G) X oz, (G) X dz, (G) — {+1}) 
= (54 x Sy x S4)/Co. 


In particular, |¢(G)| = (4!)°/2 = 283°. 

It remains to determine ~(G). We denote this group by H for nota- 
tional simplicity. We may label the vertices of the cube 1,2,...,8 in such a 
way that H =< u,d,l,r,f,b >, where u = (1,3)(2,4), f = (1,8)(4,5),d = 
(5,7)(6,8),6 = (3,6)(2,7),r = (2,5)(1,6), 2 = (4,7)(3,8). The action of H 
on the set of vertices of the cube has two orbits ({1,3,6,8} and {2,4,5, 7} 
in our labeling above), which we denote for simplicity by V; and V2. There 
are homomorphisms 


wm: Sy, wo2:H— Sy, 


but we shall not say much about these. Instead, we use GAP [Gap] to 
determine more about H. According to GAP, this group has |H| = 96 
elements and 10 conjugacy classes (by the way, GAP also says that all the 


generators u,...,b are conjugate): 
size representative 
1 1 
12 d=(5,7)(6,8) 
32 l*d=(3,6,8)(4,5,7) 
3 d*I*b*l= C v4) (5,7) 
12 d*b*1=(2,4,7,5)(3,6) 
3 I*b*P*u=(1,3)(6.8) 
12 b*1*u=(1,3,6,8)(4,7) 
3 | w*d=(1,3)(2,4)(5,7)(6,8) 
12 Hd*uc(LB6, 8)(2,4,5,7) 
6 | u*r*d*£=(1,3)(2,5)(4,7)(6,8) 


The stabilizer in H of any vertex v € V, written H,, is a subgroup of 
order 24 isomorphic to the symmetric group $4. 
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Furthermore, H has a normal subgroup N of order 48 (and index 2), 
where N is a semidirect product of C3 by C3, with C} normal in N. 

This is all we shall say about H. 

The order of G is therefore G = |¢(G)|- |H| = 96 - (4!)°/2 = 2'°3*, as 


claimed above. 


12.2. PGL(2,F;) and two faces of the cube 


The material in this section was communicated to me by Dan Bump (the idea 
originally arose in D. SIngmaster’s [Si]). This section is relatively advanced 
in that it requires more mathematical background from the reader than the 
previous chapters. 

This section is devoted to determining” the two-face group generated by 
only two basic moves, < F,U >. D. Singmaster [Si] has shown that 


<FLU>2 S7 xX PGL4( 5), 


where PGL2(Fs) is a group of order 120 which is defined below. Here $7 
arises from the action of the Rubik’s cube group on the edges and PG'L2(Fs) 
arises from the action on the corners. In this chapter, we focus on the action 


on the corners. 


12.2.1 Finite fields 


In this subsection, we introduce fields and especially finite fields. 


The general definition 


A field is a set F’ with an addition law + and a multiplication law - which 
obeys the a list of properties similar to those for the field of real numbers R. 


More precisely, we call (F,+,-) a field if 


(F1) (F,+) is an abelian group, with an identity element denoted 0 ("the 
additive group of the field”), 


(F2) for all xz, y,z € F, («+ y)z = x24 yz ("distributive law”), 


(F3) (F — {0},-) is an abelian group, with an identity element denoted 1 
(“the multiplicative group of the field”). 
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It happens to be true that if F' is a finite field then not only is (F'— {0}, -) 
an abelian group, it is actually a cyclic group. 


Definition 227 Let F,, Fy be wo fields. A function f : Fy, — Fy is called a 
field isomorphism if 

(a) when f is restricted to the additive group (F\,+), call this restriction 
f again, it yields an isomorphism of groups f : (Fi, +) > (Fo,4), 

(b) when f is restricted to the multiplicative group (F\ — {0},-), call this 
restriction f again, it yields an isomorphism of groups f : (Ff, — {0},-) > 


(Fy _ {0}, ‘). 


I 


A construction of F, 


Recall from §2.3 that congruence modulo n (n > 1 an integer) is an equiv- 
alence relation. Let n = p be a prime and let & denote the equivalence 
class of k with respect to this equivalence relation. Let F, denote the 


finite field with p elements, so F,, is, as a set, 


*, = {0,1,...,p— 1}, 


with addition and multiplication being performed mod p. 


Example 228 When p= 5, Fs will denote the finite field with 5 elements, 
so Fs is, as a set, 


"s = {0, 1,2,3,4}, 


with addition and multiplication being performed mod 5. 


It is a general fact that if F is any finite field then there is a prime 
number p such that px = 0 for all « € F. This prime number is called the 
characteristic of Ff. The easiest example of a finite field with characteristic 


p is the finite field having p elements, F,. It is not hard to see that any 
other finite field F' of characteristic p must be a finite dimensional vector 
space over F,. (Even if you’ve never seen a ”vector space over F,,” defined 
before, if you know what a ”real vector space” is then you’ve got the right 
idea.) The dimension of the vector space F is called the degree of F' over F,,, 
denoted d= [F': F,], and F is called a field extension of F,, of degree d. It is 
a general fact that for fixed p,d there is, up to isomorphism, only one such 
field. 


Next, we shall show how to construct in a very simple way such extensions. 
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A construction of finite fields 


First, some general remarks. Since F is a finite dimensional vector space 


containing F,,, it has a vector space basis which we label as 
€y = 1,€2,..., €g- 


Thus F is, as a set, the collection of elements of the form 


I 


LC, + ...Ln€n, ve hy. 


Since F is a field, there are cf € F,, which we call structure constants, such 


that 
n 
N k 
eje; = Ciy€k: 
k=1 


There are d* € F,, which we call inversion constants, such that 


n 
-1 _ y k 
k=1 


These constants determine how to multiply and divide elements of F'. We 
shall consider Ff’ ”completely described” once we explicitly determine these 
constants. 


Example 229 Let p= 5, so Fs = {0,1,2,3,4}. The set of squares is given 
by 


fx? rE ‘st = {0, 1,4}. 
In particular, 2,3 are not squares in this field. Let e, = V2 be a formal symbol 
for some element which satisfies e3 = 2. This is a root of the polynomial 
z?—2=0. 
The vector space F over Fs with basis {e, = 1,e2} is 2-dimensional over 


's. Two elements x1e1 + 22€2 = 21 + ayV2 and y,e, + y2e2 = yr + yoV2 are 
multiplied by the rule 


(21+ ryV2) ‘(y+ y2V2) = a11yi + 2xoy2 + (Liye + ya) V2. 


It is a degree 2 field extension since 


and 
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The construction used in the above example may be summarized more 
generally as follows: 


1. Pick an element m € F, which is not the square of another element. 


2. Let ey = 1 and eg = Vm be a formal symbol for some element which 
satisfies e} = ™. 


3. Asa set, let F = {xe, + yeo | x,y € F,}. To define F as a field, let + 
be ”componentwise addition” mod p and let - be defined by 


(a1 + voVm) “(yi t+ yoV im) = 2141 + Margy2 + (X1y2 + yite)V Ti. 


A finite field F’ constructed in this way is called a quadratic extension of 


I 


P 


More generally, let d > 1 be an integer. 


1. Pick an element m € F, which is not the d‘” power of another element. 


2. Let e, = 1, let eg = m'/¢ be a formal symbol for some element which 
satisfies ef =m, and (if d > 2) let e; = ef)? for? = 3,...,d. 


3. As a set, let F = {aje, +... 4+ ageq | a; € F,}. To define F asa 
field, let + be ”componentwise addition” mod p and let - be defined by 
expanding and collecting (a1¢1 +... + vaea)- (ye +... + Yala). 


A finite field F’ constructed in this way is called a degree d extension of 


r,. It has p? elements. It turns out that any two fields having p’ elements 
must be isomorphic. Therefore, any finite field must be isomorphic to one 
described above. 


Definition 230 The projective plane 


P*( ) = {0,1,...,p ~~ 1, co} 


is defined to be the set of lines through the origin in the Cartesian plane F? 


p? 
associating each number (including co) with the slope of the corresponding 


line. 
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12.2.2. Mobius transformations 


If a,b,c,d € 


Mobius transformation f by: 


fr: PY 


wr 


", are given numbers (not all equal to zero) then we define the 


‘) Pp ( 
ax+b 


ca+d* 


"») 


a b 


Theorem 231 f is a bijection if and only if det ( od 


Js 


Before proving this, we need the following 


Definition 232 Define 


; b 
=((" :) | a,b,c,d € 


This set is a group under ordinary matrix multiplication and, furthermore, 
acts on the set P'(Fs) by means of Mobius transformations thus defining a 
function 


GL(2, 


';, ad— be #0}. 


's) a P'( 


‘) 


Lemma 233 (a) The center of GL(2, 


") (ue., the subgroup of all elements 


which commute with every element in GL(2, 


\)) ts given by 


Z(GL(2, 


m=t(G 2) lee 


(b) This subgroup is normal in GL(2,F,). 


(c) There is an isomorphism 


Z(GL(2,F 


)Ca)=( 


proof: (a) Since 


r 0 
0 r 


r 0 
0 r 


"na FOF. 


) 


a ob 
c ad 
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we can conclude that 


(5 ) |a€F,, a £0} C Z(GL(2,F,)). 


To show that 


ZGLOB) C{( 5 9) [ae Bn AO) 


Cee )Coad= Go) a) 


for all a,6,c,d. This implies bu = cs for all 6,c. This is impossible unless 
u=s=0. This in turn forces cr = cv, for all c. This implies r = v. This 
proves the desired inclusion. 

The proof of parts (b) and (c) are left as an exercise for the reader. 


assume that 


Definition 234 The quotient group, denoted PGL(2,F,) = GL(2,F,)/Z2(GLQ,F 


is called the projective linear group. (This is a group since the center is a 
normal subgroup by the lemma above.) 


Lemma 235 This group PGL(2,F,) acts on the set P'(F,) by means of the 
linear fractional transformations. 


Remark 24 In fact, the action of PGL(2,F,) on the set P'(F,,) ts 3-transitive. 
This not hard to prove but we left it to the interested reader to look it up in 
[R] (see Theorem 9.48). 


proof: First, we show that the GL(2,F,,) acts on the set P'(F,,) by means 
of the linear fractional transformations. In other words, if 


a ax+6 
a(' = 


(a) ( ; , ) (x) = «x, for all x (i.e., the linear fractional transformation 


@ ( ; , ) is the identity map), 
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(b) 6(A) 0 6(B) = ¢(AB), for all A, B € GL(2,F,). 


We leave (a) to the reader and check (b). Let A = ( : y 


(" * ). then 
uioov 


sO 


) and B= 


— f{ ar+bu as + bv 
ap= (0 er ein) 


(ar + bu)a + as + bv 


o(AB)(x) = (cr + du)e +es + du 


On the other hand, ¢(A)(¢(B)()) is equal to 


re bs af tats b 
(ay ZE*) = eet 


ULTU 


Simplifying this, we see that the last two displayed equations are equal. This 
verifies (b). 

Therefore, GL(2,F,) acts on the set P!(F,). 

Let Z = Z(GL(2,F,)). Since ¢ ( : ¥ ) =o ( - " ). for all non- 
zero r, it follows that we may define an action of PGL(2,F,) on the set 


P'(F,) by ¢(A- Z) = o(A), for all A € GL(2,F,,). 


proof of the theorem: Let ¢ be as above and let f be as in the statement 
of the theorem. 

(<=): Since GL(2,F,,) acts on the set P'(F,), we have 1 = ¢(AA7') = 
¢(A)¢(A™"), so 6(A) is invertible. This implies that f is a bijection. 

(=): We prove the contrapositive. Suppose that det ( : ¥ 
result in linear algebra (see any text book, for example [JN]), the row vectors 
of this matrix are linearly dependent. This implies that there is an r € F, 
such that either (a,6) = r- (c,d) or (c,d) = r- (a,b). In either case, the 

az+b 


quotient f(2) = 25) is a constant independent of x, so cannot be surjective. 


CL 


This proves that f is not a bijection, which verifies the contrapositive. 


= (0. Bya 
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12.2.3 The main isomorphism 


Let G denote the Rubik’s cube group. Let H be the subgroup generated by 
F and U: 
H=<FfU>. 


Exercise 12.2.1 Show the group H acts on the set of vertices above (via the 
Rubik’s cube group). 


We describe how to label the six vertices on the “up” and ”front” faces 
of the cube, 
fru, flu, dfl, dfr, bru, blu, 


with the elements in the projective plane 


P'(Fs) = {0,1,2,3,4, co} 


in a certain way. More precisely, we will show label the 6 vertices above with 
elements of P'(Fs) in such a way that (a), (b) of the following theorem hold 
true. 


Theorem 236 There are ao,44, bo, 61,0, C1, d0,d1 € Fs (given explicitly be- 
low) such that 

(a) the action of F (the usual rotation of the front face) on these vertices 
is the same as the action of some linear fractional transformation 


age + bo 
Cot + do 


(b) the action of U (the usual rotation of the up face) on these vertices is 
the same as the action of some linear fractional transformation 


fr(x) = 


a,x + by 


fu(x) = cxetd, 


In other words, the basic moves F’, U may be regarded as linear fractions 
transformations over a finite field! 


Theorem 237 PGL(2,Fs) =< fr, fu >. 


Remark 25 PGL(2,Fs) is isomorphic to Ss (this is part of Exercise 9.25 


We shall prove these below. 
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12.2.4 The labeling 


Label the up and front vertices as (see Fig. 12.1) 


0 4 
R 
F 
1 oe 
Figure 12.1: 
Let 
fr(o) ==, fol) = 8043 
es aan ULL) = of ; 
The map 


eo: Ft fp, Uroky, 


extends to a surjective homomorphism of groups 


@:<FU> > <frfu>c PGL(2, 5). 


Exercise 12.2.2 Verify the first theorem above. 


12.2.5 Proof of the second theorem 
Let 


a b ‘ 
(68) epounen 
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denote the image of 


under the natural map GL(2,Fs) ~ PGL(2,Fs), g —>3 FF * g. 


Since 
2 {90 -l 
UT\ 1 0 . 
we have 
0 —-l 
1 0 . e< fu, fr >. 
Since 


fexte=(= 2) 


Conjugating this matrix by f7, we find that 


(1 1) es fle: 


It is known that SZ(2,F;) is generated by elementary transvections (see 


[R]). Therefore, 


it follows that 


PSL(2,Fs) C< fr, fu >C PGL(2,Fs). 
It is also known (see [R]) that 
|PSL(2,Fs)|=60 and |PGL(2,F;)| = 120. 


It remains to show that there is an element of < fr, fy > which does 
not belong to PSL(2,F;). We claim that such an element is ft. Note that 
det( ft) belongs to the set 


3(FZ)° = {30° | « © FF}. 
But an element of PS'Z(2,F;) must have determinant 1. Since 37! = 2 (mod 


5) is not a square mod 5, there is no element of Fs; which satisfies 1 = 32°, 


Thus fy does not belong to PS'L(2,Fs). 


Exercise 12.2.3 As an application of theorem 237, use the example in sec- 
tion 5.4.2 to show that there exists an embedding Dig —~ PGL(2,Fs) of the 
symmetry group of the hexagon into PGL(2, Fs). 
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12.3 The cross groups 


Define a cross move of the cube to be a move of the form X * Y~1, where 
X,Y € {R,L,U,D,F,B}. The subgroup of the Rubik’s cube group gener- 
ated by the cross moves will be called the cross group. 

The cross moves permute the set V of vertices of the cube and there- 
fore generate a subgroup of Sy. This is called the vertex cross group. The 
cross moves permute the set F’ of edges of the cube and therefore generate a 
subgroup of Sz. This is called the edge cross group. 


All the enties in the following table are, as far as | am aware, are new 
except for the Mj. entry. 


Rubik polyhedra | edge cross group | vertex cross group 
tetrahedron As = PSL.2(Fs) Cy x Cy 
cube Aj PSL.2(F7) 
octahedron Aj PSL.(BFs) 
dodecahedron Aag Aoag 
icosahedron Aag Aye 
rubicon Azo Mie 


In fact, the subgroup of the dodecahedral edge cross group generated by 
a subset of the cross moves can yield (smaller but still simple) alternating 
groups. 

Problem: (F. Dyson) Work out the analogous cross groups of the ” Ru- 
bicized” 4-dimensional regular polyhedra. 

This is only known for the 4-dimensional Rubik’s hypercube. In the 4- 
dimensional case, one can also define the face cross group since the moves of 
the 4-dimensional Rubik’s hypercube also permute the set F' of 2-dimensional 
faces. We have the following results: 


Rubik polyhedra | edge cross group | vertex cross group | face cross group | 


4-dimensional cube Ar Ay Ar 


Problem: Are any of the analogous cross groups of the ” Rubicized” 
3-dimensional Archimedian polyhedra simple? 

It appears that the vertex cross group of the regular truncated cube is 
not simple. 
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12.3.1 PSL(2,F;) and crossing the cube 


Let C denote the vertex cross group of the Rubik’s cube. 


Theorem 238 C = PSL,(F7). 


The first proof of this is by computer! 

first proof: Gap [Gap] gives that C is a simple group of order 168. By 
the classification of simple groups (or, more simply, Exercise 9.26 in [R]), C 
must be isomorphic to PSL2(F7). 

The second proof is from [CD]. 

second proof: PS L2(F7) can be generated by the three matrices: 


na(1 2 )ae(3 1)e(82): 


We will label the vertices of the cube in the following manner: (see Fig. 
12.2) 


Figure 12.2: Labeling the cube by the projective line P'(F7) 


Under this labeling, we can show that 
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e the image of the move m, = (UD~')* will permute the vertices in 
this way: (oo, 0)(1,6)(2,3)(4,5), The same permutation is given by the 
mobius transform (0a — 1)/(« +0) acting on Pi ‘7). 


e mz = UR" gives us the permutation: (0,1,3)(2,5,4), which is given 
by (22 +1)/(Oxr + 1) acting of P!'(F7). 


e m3 = BU'LB™ gives us the permutation (1, 2,4)(3,6,5), which is 
given by (2x + 0)/(0z + 1) acting on P!(F;). 


You should notice that if the constants in these Mobius transformations 
(a, b,c,d) are written in matrix form, they correspond to the generators of 
PSL.(F7). Now we will define a homomorphism q : C — PSLI,(F7), such 
that g(m1) = fi, q(me2) = fo, q(m3) = fz. We want to show that our q is an 


isomorphism. 
To do this we will first show that it is surjective. Let f be a matrix in 


PSL.2(F7), which can be written as a product of generators fi, fo, fs (where 
g(m1) = fi,g(m2) = fe,g(ms) = fs). Now take f as some element of 
PSL.(F7). f can be broken down as a product of its generators, f1, fo, fs, 


we'll say 
k=1 


Since we have a homomorphism, we can write it as a product of the images of 
the generators of C. Again we can rewrite it as f = q([[,;_, m;"). Therefore 


q is surjective. 


To show that ¢ is one to one we need to know that PSZ2(F7) has order 
168 [R], and that the order of the cross group is also 168. (This fact was 
proven by computer.) We will prove by contradiction that q is one to one. 

Now we assume that c; and cy are elements of C’, such that q(c1) = ¢q(ca), 
and c, is not equal to c. |PSL.(F7)| = |q(C)|. We now subtract cy from 
C , and |¢(C)| = |¢(C — c2)| because g(c1) = ¢g(c2). Now we can say that 
lq(C — €2)| < or = |C — c2| because we know that q is surjective. 


Since we have taken cy out of C, we know |C — @| < |C|, which by 
transitivity implies |PSL2(F7)| < |C|. This is a contradiction because we 
know |PSLI2(F7)| = |C|. Therefore g is injective. 

Now that we have shown that q is both surjective and injective, it is 


bijective and an isomorphism. 
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The above proof of the theorem tells us explicitly that there exists a 
labeling of the vertices V of the cube by the elements of the projective line 


P'(F) = {oo, 0,1, 2,3, 4,5, 6}, 


with the property that there is a move c: V — V in C if and only if there is 
a Mobius transformation f : P'(F) > P!(F) in PSL2(F7). 

Because the group of Mobius transformations in PS L2(F7) acts 2-transitively 
on the projective line P'(F) (see [R], Theorem 9.45), it follows that we have 
the following 


Corollary 239 C' acts 2-transitively on V. In other words, for any ordered 
pairs (v1, 02), (v}, v2) of distinct vertices there is an element c € C sending 
v; to vi, fort =1,2. 


12.3.2 Klein’s 4-group and crossing the pyraminx 


We leave the main result of this section as an Exercise - actually more of a 
project - for the reader. 


Exercise 12.3.1 (hard) Show that the subgroup of Sy generated by the twist- 
untwist moves of the pyraminz is isomorphic to the Klein 4 group Cy x Cd. 


The determination of the cross group for the megaminx is due to J. Con- 
way. It will be presented in the next chapter. 
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Chapter 13 


Crossing the Rubicon 


”Mathematical structures are among the most beautiful dis- 
coveries by the human mind. The best of these discoveries have 
tremendous metaphorical and explanatory power.” 

Douglas Hofstadter 

METAMATHEMATICAL THEMAS, 1985 


Much of the material here can be found in [CS] and is due to John Conway. 
The title of this chapter is, however, ” borrowed” from a similarly worded title 
of an article by D. Hofstadter [H]. More details on parts of this chapter may 
be found in Ann Luers’ paper [Lu]. 

This chapter shall be a little more advanced than some of the others. The 
reader will be assumed to be familiar with some topics covered in a course 
in linear algebra and elementary number theory or coding theory. We shall 
also assume some results from Rotman [R], though for the understanding of 
the material in this chapter the reader may simply take them on faith. 

Let g1,.--;912 denote the basic moves of the Rubik isocahedron. A sur- 
prising result of Conway states that the group generated by g; * g;* is the 
simple ” sporadic” group Mj2. (This is stated more precisely below.) We shall 
describe, in this chapter, what M2 is and some of its remarkable properties. 
They form a basis for my opinion, which I hope you will agree with, that 
Mj, is one of the most interesting objects in mathematics. 

Let p be a prime unless otherwise stated and let g be a power of p. We 
shall assume that p> 3. 
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13.1 Doing the Mongean shuffle 


Consider a deck of 12 cards labeled 0, 1, ..., 11. Let r,s be the permutations 
r(t) = 11 -#, 8(t) = min(2t, 23 — 2t). 


The permutation r reverses the cards around and the permutation s is called 
the ”Mongean shuffle”. To perform the reverse shuffle, simply take a stack 
of cards (face down, say) in your left hand and put them in your right hand 
one-at-a-time (face down). To perform the Mongean shuffle, take the same 
stack of cards and, one-at-a-time, put them alternately into one of two piles: 
the first card face up into the first pile, the second card face down into the 
second pile, the third card face up into the first pile, the fourth card face 
down into the second pile, and so on until the pile is exhausted. Now pick 
up the first pile of face up cards, flip the entire pile over so that they are all 
fade down and put it on top of the second pile. 


cards | reverse shuffle | Mongean shuffle 
0 11 0 
1 10 2 
2 9 4 
3 8 6 
4 7 8 
5 6 10 
6 5 11 
7 4 9 
8 3 7 
9 2 5 
10 1 3 
11 0 1 


Definition 240 The Mathieu group Mj, is defined to be the permutation 
group Miz =< r,s >< Syo. 


13.2 Background on PSL» 


We need a few basic facts about the projective special linear group of degree 
2. We have already discussed the related group GL(2,F,) in the previous 
chapter, so we refer to there for more details. 


13.2. BACKGROUND ON PSL, 


Definition 241 (1° version) Define SLo( 
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‘,) to be the group of all 2 x 2 


matrices having entries taken from the finite field F, and having determi- 


nant one. This is called the special linear group of degree 2 over F,. The 


center of this group, denoted Z, is the subgroup of 2 x 2 “scalar” matrices 


of the form diag(z,z), where z € {1,-1}. 


(This is a normal subgroup of 


SL(F,).) Define PSL2(F,) to be the quotient SL2(F,)/Z. This is called the 


projective special linear group of degree 2 over F,. 


Definition 242 (2 version) Define the projective line P'(F,) of the finite 
field F, to be the q+1 values of the formal ratio x/y, where x,y run over all 
elements of F,. If y = 0 then we denote this formal value by oo, so 


P*( "7) =F, U {oo}. 


Ifq=p is a prime then we denote 


P'(F,) = {00, 0,1)... 


Define PSL2(F,) to be the group of all Mobius transformations on the pro- 


jective line 


f(x) = (ax + b)/(cx + d), 


LE P*( a), 


where ad — be = 1 and a,b,c,d € F, (We define f(oo) = a/c.) 


Mobius transformations are bijections from the projective line to itself, 


so we may interpret each Mobius transformation as an element of Sy , where 


X = P'(F,) (and therefore also of S,,, where n = |P'(F,)| = q+ 1). 
Example 243 Let p= 11 and let f(x) = —1/2. Then 


oe 

— 
8 

—* 


RoemraA MK wwmK SD BA 
MAK WSWARaESES 
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Therefore, as a permutation, f = (00, 0)(1, 10)(2, 5)(3, 7)(4,8)(6, 9). 


The following facts are known about the projective special linear group: 


Theorem 244 [fq > 3 then PSL2(F,) is a simple group. Moreover, for all 
prime powers q, 


|PSL(F,)| = (q? — 1)q/ged(2,q — 1). 


(Recall a simple group was defined in Definition 163.) This theorem is 
over 100 years old. It is proven, for example, in [R]. 


Theorem 245 Choose ak € F, such that << k >= ie Let 


fife)=a41, fola)=k-x, — fa(x) = —-1/z. 


Then PSL.(F,) is generated by fi, fo, and fs. In particular, the action of 
PSL.(F,) on the projective line X = P'(F,) yields an injective homomor- 
phism PSLI2(F,) > Sx. 


Basically, this is proven in [R] as well. 


13.3. Galois’ last dream 


Supposedly, the night before he died in a duel, Galois wrote a letter to a 
friend stating the following remarkable theorem: 


Theorem 246 (Galois) Assume p> 11. Then PSL2(F,) has no embedding 
into a symmetric group S, with n <p. 


The following isomorphisms (for g < 11) are known: 


Ag, q= 3, 
PSL4( s) > As, q= 5, 
Ag, q= 9 


If p = 7 or p = 11 then explicit embeddings of PSL2(F,) into Ag (p = 7), 
Aj (p = 11) are known (see [CS], ch 10, or [K] for an excellent discussion of 
this). 
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13.4 The M). generation 


One of the most amazing aspects about Mj». is its close relationship with 
other ”interesting” groups. 


Definition 247 Define the permutation fy of the set P'(F,,) = {00, 0,1,...,p— 
1}, for3 <p <11, as follows 


I, p=3, 

fa= (1,2)(3, 4), P=9%,; 
, (1, 2)(3, 6), p=, 
(2, 10)(3, 4)(5, 9) (6, 7), p=ll 


We have run across the group S¢ before, when studying the symmetries 
of the icosahedron. We have also seen that Sg is rather an interesting group 
because it is the only non-abelian symmetric group 5, which has an outer 
automorphism. One rather connection between Mj. and S¢ is given by the 
following 


Theorem 248 (a) If p=5 then Sg =< fi, fo, fs, fa >. 
(b) [fp =11 then Miz =< fi, fo, fs, fa >. 


We shall see another interpretation of Mj. below using coding theory! 


Definition 249 Let 


_ x? /9, LE ( 3)? _ 0, 
00) = {98 pee) map 


This is an element of Sx, where X = P!(F23). Define the Mathieu group Moa 
by 


< f,5| f © PSLo(Bos) >. 


This is a permutation group in Sx, where X = P'(Fo3). 


By the way, 

(a) (Fo3)? = {0, 1, 2,3, 4,6, 8, 9, 12, 13, 16, 18}, 
(b) Miz =8-9- 10-11-12 = 95040, 

(c) |Mo4| = 244823040. 


o— 
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13.5 Coding the Golay way 


Codes are used in everyday life, from ISBN numbers on books to barcodes on 
food products to music CDs to satellite transmissions. There are many types 
of codes, some more efficient than others, some with better error correcting 
ability than others, some more practical than others, and so on. We shall 
concern ourselves only with aspects which are related to (in one way or 
another) permutation puzzles. 


Definition 250 A g-ary code is a subset C of a finite dimensional vector 
space V over the finite field F,. A code word is an element of C. The number 
of coordinates (i.e., the dimension of V) is called the length of the code word. 
If q = 2 then the code is called binary (instead of 2-ary) and if ¢ = 3 then 
the code is called ternary (instead of 3-ary). 


Example 251 V =F" =F, x... x F, (n times) is a code. 


Definition 252 Let Mon,(F,) denote the group of alln x n matrices which 
have exactly one non-zero entry from F, per row and per column. An element 
of Mon,(F,) is called a monomial matrix. 


Exercise 13.5.1 (a) Show Mon,(F,) is a group. 
(b) Show |Mon,(F,)| = (q¢— 1)" - nl. 


Definition 253 The set of all AG Mon,(F,) such that A*xC =C (i.e., are 
the same code) is called the automorphism group of C, denoted Aut(C). 


We shall see some examples below. 


Definition 254 [fw is a code word in F" then number of non-zero coordi- 
nates of w is called the weight of w, denoted wt(w). 


A cyclic code is a code which has the property that whenever (co, C1, ..-5n—1) 
is a code word then so ts (€n—1, Co, -+-Cn—2)+ Lf (Co, C1, +) Cn—1) ts a code word 
in a cyclic code V then we call 


g(x) =cotqat.. te 10"! 


a generator polynomial for C . 
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Example 255 The code with elements 
(1,1,0,1,0,0,0),(0,1,1,0,1,0,0),(0,0,1,1,0,1,0), (0,0, 0,1,1,0,1) 
is a binary cyclic code of length 7 and weight 3. It has generator polynomial 
g(x) =l4+er4r’. 


(This code is called a "Hamming code” and has many interesting properties 
which, to describe, would take us too far afield. The interested reader is 


refered to [CS], ch. 3.) 


Definition 256 Let n be a positive integer relatively prime to q and let alpha 
be a primitive n-th root of unity. Each generator polynomial g of a cyclic code 
C of length n has a factorization of the form 


g(x) = (a4 - a®)..(a — a’), 


where {ky,...,kp} C {0,...,n—1}. The numbers a®,1 <i <r, are called the 
zeros of the code C’. They do not depend on the choice of g. 


Definition 257 Let p and n be distinct primes and assume that p is a square 
modn. The quadratic residue code of length n over F,, is the cyclic code whose 


generator polynomial has zeros 
f{a* | k is a square mod n}. 


The binary Golay code GCo is the quadratic residue code of length 23 over 
‘5. The binary Golay code GC is the code of length 24 over Fz obtained by 
appending onto GC a zero-sum check digit. 

The ternary Golay code GC, is the quadratic residue code of length 11 
over F3. The ternary Golay code GC» is the code of length 12 over F3 ob- 


tained by appending onto GC, a zero-sum check digit. 


The following result illustrate how the Matheiu groups arise in coding 
theory. 


Theorem 258 (a) There is a normal subgroup N of Aut(GCi2) of order 2 
such that Aut(GCi2)/N is isomorphic to Mig. 
(b) Aut(G'Co4) = Mo,. 
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Since the Mathieu groups are so large, this theorem above indicates that 
the Golay codes GCj, and GCo4 have a lot of symmetry. 

It is a basic rule of thumb in mathematics that whenever you find some- 
thing displaying a lot of symmetry then it will quite often have other inter- 
esting properties. With this philosophy spurring us on, let us turn to some 
of the other properties of these codes. 


Lemma 259 Any two code words in GC 4 differ by 8 bits. The code GC 4 
detects 4 errors (per 24 bits) and corrects 3 errors. 


When you compare that with the correcting ability of bar-codes or ISBN 
codes (which have a check-digit), GC24 is much better. 


Lemma 260 [fw is a code word in GCy4 then wt(w) is either 0,8, 12,16, 24. 
Definition 261 The code words of weight 12 in GC 4 are called dodecads. 


We may identify a code word w = (€o, 1, ..-,€23) with the set of indices i 
of the non-zero coordinates ¢; # 0. 


Theorem 262 Mj» is the stabilizer in Mo4 of a dodecad, regarded as a set 
of indices. 


13.6 Mj. is crossing the rubicon 


The result of this section was mentioned briefly in §12.3 above. 

Let fi, fo,..., fig denote the basic moves (27/5 degree turns of a ”pen- 
tagon” about a vertex) of the Rubik isocahedron, regarded as elements of 
Sv, where V denotes the set of 12 vertices of the Rubik isocahedron (”rubi- 
con”). 

The following remarkable result is due to John Conway [CS]. 


Theorem 263 M,, =< x *y7! la,y € {fi,., fio} >- 


In other words, the Mathieu group Mj» is generated by the twist-untwist 
moves of the Rubik isocahedron. If we call a ”twist-untwist” move of the 
form x * y~' (with x,y as in the theorem above) a cross move then (with 
apologies to Caeser) the theorem above says that Mj is generated by the 
crosses of the rubicon. 

In fact, C acts 5-transitively on the set of vertices of the rubicon (this is 
implicit in [CS]). 
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13.7 An aside: A pair of cute facts 


It’s hard to resist stating some more interesting facts about the Mathieu 
groups. 


13.7.1 Hadamard matrices 


Let A = (aj; )1<i,;<n denote a real n Xn matrix. The following question seems 
quite natural in a course in advanced vector calulus or real analysis: 


Question: What is the maximum value of | det(A)|, where the entries of 
A range over all real numbers |a;;| < 1? 


From vector calculus we know that the absolute value of the determinant 
of a real square matrix equals the volume of the parallelpiped spanned by 
the row (or column) vectors of the matrix. The volume of a parallelpiped 
with sides of a fixed length depends on the angles the row vectors make with 
each other. This volume is maximized when the row vectors are mutually 
orthogonal, 1.e., when the parallelpiped is a cube in R”. Suppose now that 
the row vectors of A are all orthogonal. The row vectors of A, |a;;| < 1, 


are longest when each a;; = +1, which implies that the length of each row 
vector is \/n. Suppose, in addition, that the row vectors of A are all of 
length \/n. Such a matrix is called a Hadamard matrix of order n. Then 
| det(A)| = (\/n)" = n"/?, since the cube has n sides of length /n. Now, if 


Ais any matrix as in the above question then we must have | det(A)| <n”/?. 


This inequality is called Hadamard’s inequality. 


What is shocking at first (at least to me) is that, there does not always 
exist a Hadamard matrix. For example, there is a 2 x 2 Hadamard matrix 
but not a 3 x 3 one. What is perhaps even more suprising is that, in spite of 
the fact that the above question (which is unsolved) arose from an analytic 
perspective, Hadamard matrices are related more to coding theory, number 
theory, and combinatorics [vLW]! 
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Example 264 Let 


1 1 ol 1 1 41 1 1 1 1 41 1 
—-l 1 1 ol 1 ol 1 -l -1l -l 1 -l 
-l -l 1 21 -1 1 IT 1 -t -1 -1 1 
-l 1 -1 1 1 -l lt 1 1 -l -l -l 


-l -l 1 -1 1 1 -l 1 1 1 -l -l 
-l -l -l 1 -l 1 1 -l 1 1 1 -l 
1 -l 1 -l 1 -1 1 1 -1 1 1 ol 
—-l 1 

—-l 1 1 -l -1l -1 1 -1 1 1 -l 1 
—-l 1 

-l -l 1 1 1 -l -1 -1l 1 -1 1 
-l 1 -1 1 1 1 -l -1l -l 1 -!l 


A:= 


This is a Hadamard matric of order 12. 


Exercise 13.7.1 Show that 

(a) if you swap two rows or columns of a Hadamard matrix, you will get 
another Hadamard matrix, 

(b) if you multiply any row or column of a Hadamard matrix by —1, you 
will get another Hadamard matriz, 

(c) if you multiply any Hadamard matrix on the left by a signed permuta- 
tion matrix (that is, a matrix with exactly one +1 per row and column) then 


you will get another Hadamard matriz, 
(d) if you multiply any Hadamard matrix on the left by a signed permuta- 
tion matrix (that is, a matrix with exactly one +1 per row and column) then 


you will get another Hadamard matriz. 


Exercise 13.7.2 Let A,B be two Hadamard matrices of order n. Call A, 
B left equivalent if there is ann X n signed permutation matrix P such that 
A = PB. Show that this defines an equivalence relation on the set of all 
Hadamard matrices of order n. 


Exercise 13.7.3 Let A be a Hadamard matrix of ordern. Let Aut(A) denote 
the set of allnxn signed permutation matrices Q such that A is left equivalent 
to AQ. Show that Aut(A), called the automorphism group of A, is a group 
under matrix multiplication. 
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The following result is yet another indication of the unique role of these 
Mathieu groups in mathematics: 


Theorem 265 (M. Hall, Assmus-Mattson [AM]) Let A be the Hadamard 
matrix of order 12 in the above example. Then Aut(A) = Miz. 


13.7.2 5-transitivity 


The following result exemplifies once more the unique role of these Mathieu 
groups in group theory: 


Theorem 266 /[fG is a subgroup of Sx for some finite set X and if G acts 
5-transitively on X then exactly one of the following must be true: 

(a) G=S,,, for some n > 4, 

(b) G=A,,, for some m > 6, 

(c) G= My, 

(d) GS Mog. 
Furthermore, each of the groups in (a)-(d) acts 5-transitively on some finite 
set, 


For a proof of this, see [CS] and [R], ch 9. 
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Chapter 14 


Appendix: Some solution 
strategies 


”'The emphasis on mathematical methods seems to be shifted 
more towards combinatorics and set theory - and away from the 
algorithm of diferential equations which dominates mathematical 
physics.” 

J. von Neumann and O. Morganstern, THEORY OF GAMES 
AND ECOMONIC BEHAVIOR, 1944 


This chapter includes some strategies for solving the 3x3 Rubik’s cube, 
the 4x4 Rubik’s cube, the masterball, the equator puzzle, the skewb, and the 
pyraminx. For unexplained notation used in some of the sections below, see 
chapter 4. 

First (this is a mathematics course, after all!) we discuss some of the 
mathematical ideas behind the computer algorithms used to study the Ru- 
bik’s cube: 


14.1 The subgroup method 


One approach to solve the Rubik’s cube using a computer has been to con- 
struct a certain sequence of subgroups 


G, = {1} < Gn—1 Sou. < Gy < Go =G, 
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where G =< R,L,F,B,U,D > is the Rubik’s cube group, which allows the 
following strategy to be implemented: 


e represent a given position of the Rubik’s cube by an element go € G, 


e determine a complete set of coset representatives of Gy41/G,: 
Gra1/Gr = VE gee Gry, some r, > 1,V0<k <n 
(note Mp1 = 1,9n4 = 1), 


e (step 1) if go € gisGi (where i € {1,...,71}) then let g = g1, and 
9, = 91 Jo (note g € G1), 


e (inductive step) if g, € G;, has been defined and if g, © gr4ijGr 
(where 7 € {1,...,n,}) then let 941 = geoij and 944, = Tear Ik (note 
Jet € Gri) 


e putting all these together, we obtain 1 = g7'g7',g7'5...g7 go, 80 
Jo = 9192---Gn-19n- 


The hope is to be able to choose the sequence of subgroups G; in such a 
way that the coset representatives are short, relatively simple moves on the 
Rubik’s cube so that the ’solution” go = 9192---Gn—1Gn 18 not too long. 


14.1.1 Example: the corner-edge method 


We now present an example - a fairly unsophisticated one but you will get 
the idea. 

Let G denote the subgroup which does not permute any corners, let G'p 
denote the subgroup which does not permute any corners or edges, let G3 
denote the subgroup which does not permute any corners or edges and does 
not reorient any corners, and let Gy = {1}: 


G, = {1} < Gs < Go < Gy < Go = G. 


This choice of subgroups crudely models the ”corner-edge method” (see the 
appendix) due to Singmaster [Si]. 
The idea is simple. 
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1. Represent a given position of the Rubik’s cube by an element go € G. 


2. Let g, denote the move which moves all the corners into the correct 
positions (i.e., permutes them into the solved position and possibly 
twists them), so gj ‘go € Gy. Let gf = 9] ‘90. 


3. Let gz denote the move which moves all the edges into the correct 
positions (i.e., permutes them into the solved position and possibly 
reorients corners and edges) and leaves all other pieces unpermuted, so 
9295 © G2. Let g = 93 'H1- 


4. Let g3 denote the move which ” solves” all the corners (i.e., twists them 
all into the correct orientation and may flip some edges) but does not 
permute any pieces, so g3 ‘gi € G3. Let gs = 93 ‘95. 


5. Let g4 denote the move which ”solves” all the edges (i.e., flips them all 
into the correct orientation) and leaves all other facets along. 


6. The ’solution” is go = 91929394. 


14.1.2. Example: Thistlethwaite’s method 


Morwen Thistlethwaite (a knot-theorist now at the Univ. of Tennessee) 
developed one of the best subgroup methods for solving the cube [FS]. He 
takes 


G, =< R,L,F,B,U*,D? >, G. =< R,L, Fk’, B’,U*, D? >, 
Gs =< R?, L?, F?, B?,U?, D? >, G4 = {1}. 


G2 is isomorphic to the ”Rubik’s 3 x 3 x 2-domino” group. It’s order is 
(8!)? - 12, according to [FS], 87.6. G3 is the ”squares” group. It’s order is 
2! . 34, according to [FS], §7.6. 


He has shown (using a computer to help with some of the work) that 


e there is a complete set of coset representatives {g1; | 1 <7 < ni} of 
G/G, such that each g1,; is at most 7 moves long (and n; = 2048), 


e there is a complete set of coset representatives {go | 1 <a < ng} of 
G/G 2 such that each go; is at most 13 moves long (and nz = 1082565), 


e there is a complete set of coset representatives {gs3; | 1 <7 < n3} of 
G/G3 such that each g3,; is at most 15 moves long (and nz = 29400), 
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e there is a complete set of coset representatives {g4; | 1 <7 < ny} of 
G3/G4 such that each g4; is at most 17 moves long (and ny = 663552). 


Therefore, the Rubik’s cube can be solved in at most 7+ 13+ 15+17 = 52 


Moves. 


More recent improvement’s on this method have gotten this number down 
to fourty-something in the “quater-turn metric” I think (see [Lo] for details 
and recent updates). 


14.2. 3x3 Rubik’s cube 


Consider the group G =< R,L,U,D,F,B > of moves of the Rubik’s cube. 
The size of the group generated by these permutations is 43252003274489856000 = 
4.3 x 101°. 


14.2.1 Strategy for solving the cube 


ly! denote 


Let 29 = y7!* x * y denote conjugation and [x,y] = x * y * 27 
the commutator, for x,y group elements. 


Let Mp denote clockwise (with respect to right side) quarter turn of the 
middle slice parallel to the right side. 


The layer method solution strategy is composed of 3 stages: 
Stage 1: Solve the top face and top edges. 
Stage 2: Solve the middle edges (and bottom edges as best as possible). 
Stage 3: Solve the bottom corners (and bottom edges if necessary). 


The corner-edge method solution strategy is composed of 2 stages: 


Stage 1: Solve and then orient the corners. 
(The move U *« F *[R,U]® * F~! permutes (ubr,ufl)(uf,ul,ub,ur).) 
Stage 2: Solve the and then orient the edges. 


”clean” edge and corner moves: 
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M}*«U-! * My! *U? * Mp *U-! « MZ edge 3-cycle (uf,ul,ur) 
(Mp * U)? *U * (Mp! «US? *U flips the top edges uf, ub 
(R? *« U?) permutes (uf,ub)(fr,br) 
(MpxU)4 flips ub,ul and flips df,db 
(r-! * D? x Rx Bol *U? * BY ufr+, bld++ 
[R, U]? permutes (ufr,dfr)(ubr,ubl) 
PF? « 1? « U? * (FP? x £7)? * U? « 1? * F? | permutes (uf,ub)(ur,ul) 
(D? « R? * D? * (F? * R?)? x UY)? permutes (ufl,ubr)(dfr,dbl) 
(Mj *U * Mp x U7)? permutes (ufl,ubr)(ufr,ubl) 
[Re D* RLU] corner 3-cycle (brd,urb,ulb) 


These moves were compiled with help from the books [Si], [B], [Sn], and 
[Gap]. 


14.2.2 Catalog of 3 x 3 Rubik’s ’supercube” moves 


The supercube is the Rubik’s cube with each center facet marked with a 
short line through it and an adjoining edge. 

* (M2*U-'* Mp! *U?* MpxU—1 M2)? is the top edge 3-cycle (uf,ur,ul), 

* (Rol * D? x R* Bo! *U? * BY? twists the ufr corner clockwise and the 
bld corner counterclockwise (and does not twist any centers). 

* My! * M5! * Mp * U7! * My! * Mp * Mp * U is the center twist u+, 
r- (for these last three moves, see [Si], [Sn]) 


14.3. 4x 4 Rubik’s cube 


The solution strategy is composed of 3 stages: 

Stage 1: Solve the corners. For this moves for the 3 x 3 Rubik’s cube. 
Stage 2: ”Pair” the edges so that the neighboring facets on neighboring 
middle edges have the same color. For this the following ”clean edge moves” 
are useful: 


e flipedge: 
D3 *« Di * Uy * F3 * US * By « DD * D5 * Dy * U, * Lf} * U3 * Ly * U2) * LP 


(due to J. Adams [A] who calls it "move 8”). This flips and swaps the 
two middle edge facets on the UF boundary. It affects some centers, 
but no other edges or corners. 
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e upedgeswap: 
Ry * Bi * D* * B} * R3 * By * D} * B} * Ry * BP x RS 


(due to Thai [T], who calls it an "11 gram”). This move affects some 
centers but no corners and only 4 edge facets. It swaps and flips the 
right-most UF edge cubie with the left-most (with respect to the B 
face) UB edge subcube, sending the U facet of the right-most UF edge 
subcube to the B facet of the left-most UB edge subcube. 


e 3-cycle 
(R2 *K U,)° 
is a 3 cycle on the edges: (Rauf, Bygur, Boul). This doesn’t affect any 
other edges and leaves all corners fixed. 


Stage 3: Solve the edges. For this the ”clean edge moves” for the 3 x 3 
Rubik’s cube. 

Stage 4: If necessary, apply the flipedge move above, 

Stage 5: Solve the centers. For this, use the following ”clean center move”: 


center3cycle = Ry! * Fy * Ry! * Fy! * Ry * Fy * Ry * Fy" 


(also called "move 9”, due to J. Adams). This move is a 3-cycle on centers 
facets, affecting no edges, no corners, and no other center facets. It is the 
3-cycle (15 19 18) in the above notation. 

Some similar clean center moves: 


center] = Bi * R3 * Fy * Ry * Bi * R3* FP * Ro, 


center2 = R3 * By * Ry * Fy * Ry * By * R3 * Fy * BR} 


These aren’t really necessary since the the center3cycle can always be applied 
after a suitable set-up move (i.e., in combination with a suitable conjugation). 
The following move is occasionally useful: 


center swap = (Rj * U3)* 


This affects only 6 center facets (on the front and back faces) and no others. 
It is the product of two 3-cycles: (15 34 23)(27 14 22) in the above notation. 
These moves were compiled with help from the books [A] and [T]. 
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14.4 Rainbow masterball 


The solution strategy 
Step 1: The idea is to first get all the middle bands aligned first, so you 
get ball corresponding to a matrix of the form 


¥ EPR X 
¥ MD DO ¥ 
*¥ WW * 
* BoB *¥ 
¥FooN ¥ 
¥xDD ¥ 
¥ NIN * 
* © © * 


Here, * denotes any color. We have labeled the colors on the masterball as 
1,2,...,8 in order of occurrence. 

We describe a method, which I call ”fishing”, for achieving this. (Mathe- 
matically, this amounts to performing some carefully choosen commutators.) 
Without too much trouble you can always assume that we have one column 
aligned. You may need to flip or rotate the ball a little bit to do this. Call 
this aligned column ”column 1” and call the color in column 1, color 1”. 
We want to get the middle two entries in column 2 aligned. Call the color in 
the (2,3)-entry ”color 2”. 

We want to get color 2 in the (2,2)-entry. The remaining large color 2 
tile is what we will ’fish” for. Hold the ball in front of you in such a way 
that column 2 is slightly to the left of center and column 3 is slightly to the 
right of center. There are 4 facets in the right upper middle band, 4 facets 
in the left upper middle band, 4 facets in the right lower middle band and 4 
facets in the left lower middle band. A flip about the center on the right half 
(i.e., perform f2) exchanges these. We may assume that color 2 is on one of 
the four facets in the right lower middle band. (If it isn’t you need to apply 
fo first). Now perform ry! * fy! * ry * fo: first perform rz! (this is ”baiting 
the hook”), then f7' ("putting the hook in the water”), then rz ("setting 
the hook”), and finally fy (“reeling in the hook”). You may or may not have 
color 2 in the (2,2) place like you want but the color | stripe is intact. If 
necessary, try again. After at most 4 tries you'll be successful. 

Step 2: Repeat this ”crab fishing” strategy to get color 2 in the (1,2) 
position (using rj! * fy! *r, * fy in place of ry! * fy! * ry * fy). Now, by 
turning the ball over if necessary, repeat this idea to get color 2 in the (4,2) 
position. Now you have two ” aligned” stripes on your ball - color 1 in column 
1 and color 2 in column 2. We say, in this case, that columns | and 2 have 
been solved”. 
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Step 3: Repeat this for columns 3 and 4. 
Step 4: Use the moves in the ”catalog” below to finish the puzzle. 


(I believe the only moves needed are the “equator2swap36” and the “po- 
lar2swap36” below, along with suitable cleverly choosen “set-up moves”.) 


14.4.1 A catalog of rainbow moves 


Column moves 


We number the columns as 1,...,8. We will use a signed cycle notation to 
denote an action of a move on the columns of the masterball. 


Example 267 A move which switches the Ist and 3rd column but flips both 
of them over will be denoted by (1 3)_. 

A move which sends the 4th column to the 6th column, the 6th column to 
the 5th column, and switches the 2nd and 3rd column but flips both of them 
over will be denoted by (2 3)_(6 5 4). 


move cycle 
fi (1,4)-(2,3)- 
fo (2,5)-(3,4)- 
fs (3,6)-(4,5)- 
fa (4,7)(5, 6)- 
fs (5,8)_(6, 7)_ 
fe (1,6)-(7,8)- 
fr (2,7)-(1,8)- 
fs (3,8)-(1,2)- 
fit for fi (1, 2)-(3, 5) 
fir fox fi fo (5,4,3,2,1)- 
fi * fa * fi (1, 5)(2, 6) 
fo * fa * fo (2,3)-(6, 5, 4) 
fit far fi (1,7)(5, 6) 
fit fs* fi (5,8)-(6, 7)- 
fi * fs* fi (2,8)(3,4)- 
fs * fi * fs (1, 8)-(4, 3, 2) 
fo * fi * fo (1,3)(4,5)- 
fa * fi * fs (1,3)(4,8)- 
Is * fi * fo (1, 4)-(2, 3,8, 5)- 
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Finally, (f, * fo * fg * fa)? #1 * rg * 3 * r4 Swaps the 7,8 columns and leaves 
all the others fixed but flipped over. 


Some products of 2-cycles on the facets 


These are all based on an idea of Andrew Southern. The polar2swap and 
equator2swap were obtained by trying variations of some of Andrew’s moves 
on a MAPLE implementation of the masterball [J]. 

We number the facets in the i-th column, north-to-south, as 21, 722,23, 74 


(where 7 = 1,2,...,8). 


move cycle 
va=ry* faery, «rg * faery (41, 84)(44, 81) 
rertea ert (41, 81)(44, 84) 
frre feerclery* fyerz!* fy | (14, 84)(11, 81) 
polar2swap36 (11, 14)(31, 61) 
polar2swap18 (61, 64)(11, 81) 
equator2swap36 (12, 13)(32, 62) 
equator2swap18 (62, 63)(12, 82) 


where 
polar2swap36 = fyxry!*ryl*fiefoxrpery  *foxrgeforr] er ax foer ye frargerat fi 


(moreover, if you replace rz by rg both times in this move you get the same 
effect), 


polar2swapl8 = fyxrz!*ry *faxfaeriery *faerga faery rae faery * foargarat fi 


equator2swap36 = fi «rj: «rg * fy * foxrg arg fy 73 fo ary erg fear 3* fi *P 447 3% fy 
equator2swapl8 = fi ery erg fax fgergery *fpergefaxry #rgkfyerg*faer garg fi 


For further details on the rainbow puzzle, see [JS], [J]. 


14.5 Equator puzzle 


Solution strategy, in brief: 
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e First, ignore the orientation of the pieces. Just try to get the pieces in 
their correct position. One of the most remarkable properties of this 
puzzle is that GAP [Gap] is, in practice, very efficient at solving this 
part of the solution. (This is remarkable in view of the fact that GAP 
is not very good at solving the corresponding problem for the Rubik’s 
cube, for example, so there is no reason to expect it to be good at 
solving this problem.) 


e Second, once the pieces are in the correct position, they must be cor- 
rectly oriented by a catalog of node” moves designed for that purpose. 
Some ”node” moves are included below. 


Some notation: if x, y, and z are moves, let 


ket k a ky * Zz. 


[v.y,zJa=ut ey 
Example 268 Some moves discovered using GAP: 


0 [r7°arp°s rg] will swap the NP and SP, while rotating the NP by 90 de- 
grees in a counter clockwise direction. Moreover, it will fix the position 
of the piece labeled as 20 but will rotate it by 90 degrees clockwise. 


e To send 1 (the NP) to 4, 4 to 7 (the SP), 7 to 10, and 10 to 1: 
(1,4,7,10) =(ryl«rgl ary erg * (ry) ery!) erg erg erp) ery 
e The square of the previous move is easier to do: 
(1,7)(4, 10) =rpargary? *rz° 

e a 3-cycle about the NP and then a 3-cycle about the SP: 

(2,22, 12)(6,8,17) =ryp*rgeryp erg) *rg*eryp ery) ery 
e if rotgy denotes a 90° solid rotation, 

roto = (2,13, 12, 22)(3, 14, 11, 21)(5, 16,9, 19)(6, 17,8, 18) «rz, 


then the following move will exchange two pairs of facets neighboring 
the NP and two pairs of facets neighboring the SP: 


(2, 12)(6,8)(13, 22)(17, 18) = (ri*rgery ery! erg*ry! ery! ery) *rotgo* 


1 


-1,.-1 - -1 -1 
K(TyKTQHT, FIZ TAKEN FTZ *11) * Oty 
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e A single 2-cycle or 3-cycle cannot be achieved in the group generated 

by ™,72,13- 

We call a move of the form [r}*, r3”, r3”] a node move since it only affects 
the nodes (where the circles intersect). The table below records where each 
node goes as well as its effect on the orientation. The position entry in a 
block is the position the piece moves to. The angle entry, if any is the angle 
the piece gets rotated by. No angle entry means, of course, that the piece 
is not rotated. No position entry means that the piece was not moved (but 
may have been rotated). If a move has no effect on the position or the angle 
then we fill in the block with a ”-”. 

In the following table, NP, SP have been denoted by 1,7, resp., for brevity. 


move,piece 1 7 4 10 15 20 
m123 7,90cew | 1,90cew | 10,180 4,180 90ccw 90cw 
m132 7,180 1,180 90ccw 90cw 20,90cw | 15,90ccw 
m231 7,180 1,180 90cw 90ccew | 20,90cw | 15,90ccw 
m213 90cw 90cew | 10,90cw | 4,90ccw 20 15 
m312 90ccw 90cew | 10,90cw | 4,90ccw 20 15 
m321 7,90cew | 1,90cw | 10,180 4,180 90cw 90ccw 

A 180 180 180 180 - - 
m321? - - - - 180 180 
B - - 180 180 - - 
nl23 1,90cew | 7,90cew 10 4 20,90ccew | 15,90cw 
D - - 90cw 90ccw 90cw 90ccw 
where 
m123 = [rp "sry? r5], 
m132 = [rpesrg. ry], 
m231 = [yrs ry], 
m213 = [rr srs]; 


A = m123 * m132 * m213, 
B=Cxm32l? *C'/,C = (1 4)(7 10) = ry, 
n123 = [r72,r5°, 73], 
D= [rr srs). 
Remark 26 Let G denote the subgroup of S39 generated by ry, 172,73. Ac- 
cording to GAP [Gap], 


|G] = 21424936845312000 = 15!- 2!4 = (2.14...) - Lol. 
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The largest normal subgroup of G which is a power of 2 is of order 2". 
Furthermore, G appears to act transitively on the following 15 pairs of facets: 


B= {[1, 7], [2,8], (13, 18], [3, 9], [14, 19], [4, 10], [15, 20], [5, 11], 


[23, 27], [16,21], [25, 29], [6, 12], [24, 28], [17, 22], [26,30]}. 


Possibly G is isomorphic to a semidirect product of Sg = 5,5 acting on Cy'. 


14.6 The skewb 


14.6.1 Strategy 


The goal here is to collect enough moves to support the following solution 
strategy: fix the centers and solve the corners using ”clean corner moves” 
(i.e., moves which do not effect the centers). 

The basic moves are twists by 120 degrees clockwise about each of the 


six corners FRU,FLU,BRU,BLU,BDR,BDL,DFR,DEL. 


14.6.2 A catalog of skewb moves 


Thanks to J. Montague and G. Gomes for comments and corrections for the 
descriptions below, which were discovered with the help of a simulation of 


the skewb written for MAPLE [Jwww]. 
1. FRU * BLU * FRU is order 3. 


2. [FRU * FLU]? twists 6 corners clockwise by 120 degrees. The 2 corners 
not twisted are those opposite the FRU, FLU corners: the BDR, BDL 
corners. The centers are all fixed. (FRU *« FLU)? = (FLU * FRU)? 
rotates all the corners except for the bd corners. It does not permute 
any facets. 


3. The move, [FRU * BLU], fixes all the centers and the 2 ”opposite” 
corners: DFL, BDR. It twists the 3 corners FLU, BLU, and FRU. On 
the remaining 3 corners, it acts as the permutation (DF'R, BRU, BDL). 


4. FRU = BDL (actually, they are only equal up to a rotation of the entire 
cube). In general, a corner move is equal to the opposite corner move 
up to a rotation of the entire cube. 
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10. 


. (FRU * FLU)? «(BDL * BDR)° rotates all the corners except for the 


bu and the df corners. The uf corners are rotated clockwise and the 
bd corners counterclockwise. It does not permute any facets. (F-RU x 
FLU)’ «(BDL * BDR)? is the same move, but rotates in the opposite 
direction. 


. (FRU *« FLU)’ * (BDL * BDR)® rotates the corners as follows: 


e the uf corners counterclockwise, 
e the db corners counterclockwise, 
e the df corners clockwise, 


e the ub corners clockwise. 


It does not permute any facets. 


Let bottomspin = (F RU * FLU)’ * (BDL * BDR) *(DFR* DFL)’. 
This move rotates the 4 bottom corners (the df corners clockwise and 
the db counterclockwise). It does not permute any facets. 


. (BRU * FLU)? is a 5 cycle on the center facets (F,R,B,U,L). It fixes 


the bottom and does not affect any corners. (BLU * FRU)? is a 5 
cycle on the center facets. It fixes the bottom and does not affect any 
corners. 


. (BLU * FRU)? «(BRU * FLU)? is a product of 2 transpositions on the 


center facets, swapping front/back and up/right. It fixes the bottom 
and does not affect any corners. 


Let U denote the clockwise (with respect to the up face) rotation of 
entire cube by 90 degrees. Then bottomspin * U * bottomspin rotates 
but does not swap 2 corners (the DFR and BDL) and does not affect 
any other corners or faces. 


14.7 The pyraminx 


Assume that the tetrahedron is lying on a flat surface in front of you, with 
the triangle base pointing away from you. The corners are denoted L (left), 
R (right), U (wp), and B (back). 

Basic Moves: Let 


268 CHAPTER 14. APPENDIX: SOME SOLUTION STRATEGIES 


e L denote the 120 degree clockwise rotation of the 2-level subtetrahedron 
containing the left corner, 


e R denote the 120 degree clockwise rotation of the 2-level subtetrahedron 
containing the right corner, 


e U denote the 120 degree clockwise rotation of the 2-level subtetrahedron 
containing the up corner, 


e B denote the 120 degree clockwise rotation of the 2-level subtetrahedron 
containing the back corner. 


First, get the center” facets solved, then twist the corner tips to solve 
them and the center facets. Finally, to solve the edge facets, use the following 
moves (given in [EK]): 


e (R,U—'] is a 3-cycle of edge pieces on the URL face, 
e [R,U~'|*[R', L] is a flip of two edges (UR edge and UL edge) on the 


URL face. 


14.8 The megaminx 


The strategy here is the same as for the 3x3 Rubik’s cube: 
e place the corners correctly first (ignoring correct corner orientation), 
e place the edges correctly first (ignoring correct edge orientation), 
e twist the corners if necesary, 
e flip the edges if necessary. 


Moves useful for carrying out these steps are included in the following 
catalog. 
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14.8.1 Catalog of moves 


First, some notation. We label the faces fi, fo,..., f6 on top and label the 
bottom faces f7, fg, ..., fig as in chapter 4. The same notation is used to indi- 
cate the move of the megaminx given by rotating that face of the megaminx 
by 72 degrees clockwise. 


ef ef hehe ty eh heheh * fy * fie he = [A fe? swaps 
the f,.f3 and the fy. f¢ corners: (fi-fo-fs. fi-fs-fa)(f-fo-fes fo-fo-f7) 


em— fax fg forty foe fs forts forts |: fox frem®s fr tf swaps 
2 pairs of corners on the f, face: (fi. fo. fe, fi-fs-fa) ft. fo-fs, fi-fs-Se) 


e fix fox fy * fox fix fe * if, * fy! 3 cycle on corners and 3-cycle on 


edges (fi. fo. fe, fil-fr-Se, fo-te- Sr) h-Sa, fet, fate) 


0 M2= fet fox fix fy * fr * te * fy) * fl) * fy! * fi ® fo fs (Mark 
Longridge) edge 3-cycle (fi. fo, fo-fs, fo-fe) 


e (fg! * fy! * fy! * fe * fo * fz)® - triple corner twister, ccw twists of 


Si-fo-fs, fi-fa-Se, fi-fs-fo 


© M3 = f\—2)* f2% fox fri fox fir fo fox fiw fare feo fy fel 
fix fy! * fs? * f * fr (Mark Longridge) edge 2-flip of fi. fo, fi-fe 


e M3a = fg! * fy! * fi* fo! * fy! * fa * fo * fo * fa * fox fi! * fo * fi * 
fg « fy’ * fy' (Mark Longridge) edge 2-flip of fi. fe, fi-fs 


For further details, see the internet sites [J] or [Lo]. 
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